
NUCLEIC ACIDS AND PROTEINS FROM STREPTOCOCCUS PNEUMONIAE 



fctt^The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 
countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the most 
common bacterial cause of acute respiratory infections, and is estimated to result in 1 
million childhood deaths each year, mostly in developing countries (Stansfield, S.K., 
Pediatr. Infect. Dis., 6: 622 (1987)). In the USA it has been suggested (Breiman et al, 
Arch. Intern. Med., 150: 1401 (1990)) that the pneumococcus is still the most common 
cause of bacterial pneumonia, and that disease rates are particularly high in young 
children, in the elderly, and in patients with predisposing conditions such as asplenia, 
heart, lung and kidney disease, diabetes, alcoholism, or with immunosupressive 
disorders, especially AIDS. These groups are at higher risk of pneumococcal 
septicaemia and hence meningitis and therefore have a greater risk of dying from 
pneumococcal infection. The pneumococcus is also the leading cause of otitis media 
and sinusitis, which remain prevalent infections in children in developed countries, 
and which incur substantial costs. 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 
to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics of 



2 



the Centres for Disease Control/National Centre for Health Statistics, 214:1 (1992)). 
The rates of penicillin resistance can be higher (up to 20%) in some hospitals 
(Breiman et al 9 J. Am. Med. Assoc., 271: 1831 (1994)). Since the development of 
penicillin resistance among pneumococci is both recent and sudden, coming after 
5 decades during which penicillin remained an effective treatment, these findings are 
regarded as alarming. 

For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
10 pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention of 
pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 

15 surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
cases. An additional problem arises because it has been found that the capsular 
polysaccharides (each of which determines the serotype and is the major protective 

20 antigen) when purified and used as a vaccine do not reliably induce protective 

antibody responses in children under two years of age, the age group which suffers the 
highest incidence of invasive pneumococcal infection and meningitis. 

A modification of the approach using capsule antigens relies on conjugating the 
25 polysaccharide to a protein in order to derive an enhanced immune response, 

particularly by giving the response T-cell dependent character. This approach has been 
used in the development of a vaccine against Haemophilus influenzae, for instance. 
There are, however, issues of cost concerning both the multi-polysaccharide vaccines 
and those based on conjugates. 

30 
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A third approach is to look for other antigenic components which offer the potential to 
be vaccine candidates. This is the basis of the present invention. Using a specially 
developed bacterial expression system, we have been able to identify a group of 
protein antigens from pneomococcus which are associated with the bacterial envelope 
5 or which are secreted. 

Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 1. 

10 In a second aspect, the present invention provides a Streptococcus pneumoniae protein 
or polypeptide having a sequence selected from those shown in table 2. 

A protein or polypeptide of the present invention may be provided in substantially pure 
form. For example, it may be provided in a form which is substantially free of other 
15 proteins. 

As discussed herein, the proteins and polypeptides of the invention are useful as antigenic 
material. Such material can be "antigenic" and/or "immunogenic". Generally, "antigenic" 
is taken to mean that the protein or polypeptide is capable of being used to raise 
20 antibodies or indeed is capable of inducing an antibody response in a subject. 
"Immunogenic" is taken to mean that the protein or polypeptide is capable of eliciting a 
protective immune response in a subject. Thus, in the latter case, the protein or 
polypeptide may be capable of not only generating an antibody response but, in addition, 
a non-antibody based immune response. 

25 

The skilled person will appreciate that homologues or derivatives of the proteins or 
polypeptides of the invention will also find use in the context of the present invention, ie 
as antigenic/immunogenic material. Thus, for instance proteins or polypeptides which 
include one or more additions, deletions, substitutions or the like are encompassed by the 
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present invention. In addition, it may be possible to replace one amino acid with another 
of similar "type". For instance replacing one hydrophobic amino acid with another. 
One can use a program such as the CLUSTAL program to compare amino acid 
sequences. This program compares amino acid sequences and finds the optimal 
5 alignment by inserting spaces in either sequence as appropriate. It is possible to calculate 
amino acid identity or similarity (identity plus conservation of amino acid type) for an 
optimal alignment. A program like BLASTx will align the longest stretch of similar 
sequences and assign a value to the fit. It is thus possible to obtain a comparison where 
several regions of similarity are found, each having a different score. Both types of 
10 identity analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
polypeptide as described herein is less important than that the homologue or derivative 
should retain the antigenicity or immunogenicity of the original protein or polypeptide. 
15 However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
Preferably, homologues or derivatives having at least 70% similarity, more preferably at 
least 80% similarity are provided. Most preferably, homologues or derivatives having at 
least 90% or even 95% similarity are provided. 

20 

In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or it 
may be the case that the fusion protein itself retains sufficient antigenicity to be useful. 

25 

In an additional aspect of the invention there are provided antigenic/immunogenic 
fragments of the proteins or polypeptides of the invention, or of homologues or 
derivatives thereof. 
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For fragments of the proteins or polypeptides described herein, or of homologies or 
derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
5 Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
10 polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

Thus, what is important for homologues, derivatives and fragments is that they possess at 
least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
1 5 which they are derived. 

Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook et 
al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). Thus, 
20 in a third aspect, the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

25 (ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 



(iv) a sequence which has substantial identity with any of those of (i), (ii) and 
(iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
5 protein as defined in Table 1. 

In a fourth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

1 0 (i) any of the DNA sequences set out in Table 2 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
1 5 sequences of (i) or (ii); 

(iv) a sequence which has substantial identity with any of those of (i), (ii) and 
(iii); or 

20 (v) a sequence which codes for a homologue, derivative or fragment of a 

protein as defined in Table 2. 

The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 

25 include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein. Such variants are encompassed by the present invention. These may 
occur in nature, for example because of strain variation. For example, additions, 
substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 

30 making use of known preferred codon usage in the particular organism being used for 
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expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has a 
5 sequence which is complementary to that of a given DNA molecule (allowing for the fact 
that in RNA "U" replaces "T" in the genetic code). 

When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFTT and GAP (both from the 

10 Wisconsin Genetics Computer Group (GCG) software package) BESTFTT, for example, 
compares two sequences and produces an optimal alignment of the most similar 
segments. GAP enables sequences to be aligned along their whole length and finds the 
optimal alignment by inserting spaces in either sequence as appropriate. Suitably, in the 
context of the present invention when discussing identity of nucleic acid sequences, the 

15 comparison is made by alignment of the sequences along their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at least 
95% sequence identity with said sequences. In some cases the sequence identity may be 
20 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater degree 
of identity with any of the sequences described herein than with prior art nucleic acid 
sequences. 

25 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 
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The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such 
vectors and suitable hosts form yet further aspects of the present invention. 

5 Therefore, for example, by using probes based upon the nucleic acid sequences provided 
herein, genes in Streptococcus pneumoniae can be identified. They can then be excised 
using restriction enzymes and cloned into a vector. The vector can be introduced into a 
suitable host for expression. 

10 Nucleic acid molecules of the present invention may be obtained from S.pneumoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic acid 
molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

15 Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in PCR 
so that a desired sequence, including whole genes or fragments thereof, can be targeted 
and then amplified to a high degree. 

20 Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to provide 
a longer sequence. 

25 

There is another group of proteins from S.pneumoniae which have been identified 
using the bacterial expression system described herein. These are known proteins from 
S.pneumoniae, which have not previously been identified as antigenic proteins. The 
amino acid sequences of this group of proteins, together with DNA sequences coding 
30 for them are shown in Table 3. These proteins, or homologues, derivatives and/or 
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fragments thereof also find use as antigens/immunogens. Thus, in another aspect the 
present invention provides the use of a protein or polypeptide having a sequence 
selected from those shown in Tables 1-3, or homologues, derivatives and/or fragments 
thereof, as an immunogen/antigen. 

5 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 1-3, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
10 immunogenic/antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 

15 It is also possible to utilise the nucleic acid sequences shown in Tables 1-3 in the 

preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. DNA 
vaccines are described in the art (see for instance, Donnelly et al , Ann. Rev. 
Immunol., 15:617-648 (1997)) and the skilled person can use such art described 

20 techniques to produce and use DNA vaccines according to the present invention. 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 
of detecting/diagnosing S. pneumoniae. Such methods can be based on the detection of 

25 antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S.pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 
Suitably, the sample is a biological sample, such as a tissue sample or a sample of 

30 blood or saliva obtained from a subject to be tested. 
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In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used to 
detect the antigens, and hence S. pneumoniae. Such antibodies form another aspect of 
5 the invention. Antibodies within the scope of the present invention may be monoclonal 
or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
10 described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique (Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

20 

Techniques for producing monoclonal and polyclonal antibodies that bind to a particular 
polypeptide/protein are now well developed in the art. They are discussed in standard 
immunology textbooks, for example in Roitt etaU Immunology second edition (1989), 
Churchill Livingstone, London. 

25 

In addition to whole antibodies, the present invention includes derivatives thereof which 
are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
30 1994). 
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Antibody fragments include, for example, Fab, F(ab')2 and Fv fragments. Fab fragments 
(These are discussed in Roitt et al [supra] ). Fv fragments can be modified to produce a 
synthetic construct known as a single chain Fv (scFv) molecule. This includes a peptide 
5 linker covalently joining Vh and Vi regions, which contributes to the stability of the 
molecule. Other synthetic constructs that can be used include CDR peptides. These are 
synthetic peptides comprising antigen-binding determinants. Peptide mimetics may also 
be used. These molecules are usually conformationally restricted organic rings that 
mimic the structure of a CDR loop and that include antigen-interactive side chains. 

10 

Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human framework 
regions, but rodent hypervariable regions. Ways of producing chimaeric antibodies are 
15 discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) and by Takeda 
et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
provides the molecule with some desirable property in addition to antigen binding. For 
20 example the moiety may be a label (e.g. a fluorescent or radioactive label). Alternatively, 
it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S. pneumoniae. Thus, 
in another aspect the present invention provides a method for the detection/diagnosis of 
25 S.pneumoniae which comprises the step of bringing into contact a sample to be tested 
and antibodies capable of binding to one or more proteins described herein, or to 
homologues, derivatives and/or fragments thereof. 

In addition, so-called "Affibodies" may be utilised. These are binding proteins 
30 selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
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(Nord etal,) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

It will also be clear that the nucleic acid sequences described herein may be used to 
5 detect/diagnose S. pneumoniae. Thus, in yet a further aspect, the present invention 

provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. Such 
10 samples may be pre-treated before being used in the methods of the invention. Thus, 

for example, a sample may be treated to extract DNA. Then, DNA probes based on the 
nucleic acid sequences described herein (ie usually fragments of such sequences) may 
be used to detect nucleic acid from S. pneumoniae. 

15 In additional aspects, the present invention provides: 

(a) a method of vaccinating a subject against 5. pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 

20 invention; 

(b) a method of vaccinating a subject against S. pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

25 (c) a method for the prophylaxis or treatment of S. pneumoniae infection which 
comprises the step of administering to a subject a protein or polypeptide of the 
invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

30 (d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
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comprises the step of administering to a subject a nucleic acid molecule as defined 
herein; 

(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one or 
5 more proteins or polypeptides of the invention, or homologues, derivatives or 

fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one or 
more nucleic acid molecules as defined herein. 

10 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
present invention also provides a method of determining whether a protein or 
15 polypeptide as described herein represents a potential anti-microbial target which 
comprises antagonising, inhibiting or otherwise interfering with the function or 
expression of said protein and determining whether S. pneumoniae is still viable. 

A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
20 prevent expression of the protein and determine whether this results in a lethal change. 
Suitable methods for carrying out such gene knockouts are described in Li et al , 
P.N.A.S., 94:13251-13256 (1997) and Kolkman et al , 178:3736-3741 
(1996). 

25 In a final aspect the present invention provides the use of an agent capable of 

antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of 5. pneumoniae infection. 



30 



As mentioned above, we have used a bacterial expression system as a means of 
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identifying those proteins which are surface associated, secreted or exported and thus, 
would find use as antigens. 

The information necessary for the secretion/export of proteins has been extensively 
5 studied in bacteria. In the majority of cases, protein export requires a signal peptide to 
be present at the N-terminus of the precursor protein so that it becomes directed to the 
translocation machinery on the cytoplasmic membrane. During or after translocation, 
the signal peptide is removed by a membrane associated signal peptidase. Ultimately 
the localization of the protein (i.e. whether it be secreted, an integral membrane 
10 protein or attached to the cell wall) is determined by sequences other than the leader 
peptide itself. 

We are specifically interested in surface located or exported proteins as these are 
likely to be antigens for use in vaccines, as diagnostic reagents or as targets for 

15 therapy with novel chemical entities. We have therefore developed a screening vector- 
system in Lactococcus lactis that permits genes encoding exported proteins to be 
identified and isolated. We provide below a representative example showing how 
given novel surface associated proteins from Streptococcus pneumoniae have been 
identified and characterized. The screening vector incorporates the staphylococcal 

20 nuclease gene nuc lacking its own export signal as a secretion reporter. Staphylococcal 
nuclease is a naturally secreted heat-stable, monomeric enzyme which has been 
efficiently expressed and secreted in a range of Gram positive bacteria (Shortle, Gene, 
22:181-189 (1983); Kovacevic et al. 9 J. BacterioL, 162:521-528 (1985); Miller et aL y 
J. BacterioL, 169:3508-3514 (1987); Liebl et ai y J. BacterioL, 174:1854-1861 (1992); 

25 Le Loir et aL, J. BacterioL, 176:5135-5139 (1994); Poquet et aL, J. BacterioL, 
180:1904-1912(1998)). 

Recently, Poquet et al. ((1998), supra) have described a screening vector 
incorporating the nuc gene lacking its own signal leader as a reporter to identify 
30 exported proteins in Gram positive bacteria, and have applied it to L. lactis. This 
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vector (pFUN) contains the pAM|3l replicon which functions in a broad host range of 
Gram-positive bacteria in addition to the ColEl replicon that promotes replication in 
Escherichia coli and certain other Gram negative bacteria. Unique cloning sites 
present in the vector can be used to generate transcriptional and translational fusions 
5 between cloned genomic DNA fragments and the open reading frame of the truncated 
nuc gene devoid of its own signal secretion leader. The nuc gene makes an ideal 
reporter gene because the secretion of nuclease can readily be detected using a simple 
and sensitive plate test: Recombinant colonies secreting the nuclease develop a pink 
halo whereas control colonies remain white (Shortle, (1983), supra; Le Loir et aL, 
10 (1994), supra). 

Thus, the invention will now be described with reference to the following 
representative example, which provides details of how the proteins, polypeptides and 
nucleic acid sequences described herein identified as antigenic targets. 

15 

We describe herein the construction of three reporter vectors and their use in L. lactis 
to identify and isolate genomic DNA fragments from Streptococcus pneumoniae 
encoding secreted or surface associated proteins. 

The invention will now be described with reference to the examples, which should not 
20 be construed as in any way limiting the invention. The examples refer to the figures in 
which: 

Figure 1: shows the results of a number of DNA vaccine trials; and 
25 Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

(i) Construction of the pTREPl-nuc series of reporter vectors 

30 
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(a) Construction of expression plasmid pTREPl 

The pTREPl plasmid is a high-copy number (40-80 per cell) theta-replicating gram 
positive plasmid, which is a derivative of the pTREX plasmid which is itself a 
5 derivative of the previously published pIL253 plasmid. pIL253 incorporates the broad 
Gram-positive host range replicon of pAMpl (Simon and Chopin, Biochimie, 70:559- 
567 (1988)) and is non-mobilisable by the L lactis sex-factor. pIL253 also lacks the tra 
function which is necessary for transfer or efficient mobilisation by conjugative parent 
plasmids exemplified by pIL501. The Enterococcal pAMpl replicon has previously 

10 been transferred to various species including Streptococcus, Lactobacillus and 

Bacillus species as well as Clostridium acetobutylicum, (Oultram and Klaenhammer, 
FEMS Microbiological Letters, 27:129-134 (1985); Gibson et aU (1979); LeBlanc et 
al., Proceedings of the National Academy of Science USA, 75:3484-3487 (1978)) 
indicating the potential broad host range utility. The pTREP 1 plasmid represents a 

15 constitutive transcription vector. 

The pTREX vector was constructed as follows. An artificial DNA fragment containing 
a putative RNA stabilising sequence, a translation initiation region (TIR), a multiple 
cloning site for insertion of the target genes and a transcription terminator was created 

20 by annealing 2 complementary oligonucleotides and extending with Tfl DNA 

polymerase. The sense and anti-sense oligonucleotides contained the recognition sites 
for Nhel and BamHI at their 5' ends respectively to facilitate cloning. This fragment 
was cloned between the Xbal and BamHI sites in pUC19NT7, a derivative of pUC19 
which contains the T7 expression cassette from pLETl (Wells et al , J. Appl. 

25 BacterioL, 74:629-636 (1993)) cloned between the EcoRI and Hindm sites. The 
resulting construct was designated pUCLEX. The complete expression cassette of 
pUCLEX was then removed by cutting with Hindlll and blunting followed by cutting 
with EcoRI before cloning into EcoRI and SacI (blunted) sites of pEL253 to generate 
the vector pTREX (Wells and Schofield, In Current advances in metabolism, genetics 

30 and applications-NATO ASI Series, H 98:37-62 (1996)). The putative RNA stabilising 
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sequence and TIR are derived from the Escherichia coli T7 bacteriophage sequence 
and modified at one nucleotide position to enhance the complementarity of the Shine 
Dalgarno (SD) motif to the ribosomal 16s RNA of Lactococcus lactis (Schofield et al. 
pers. corns. University of Cambridge Dept. Pathology.). 

5 

A Lactococcus lactis MG1363 chromosomal DNA fragment exhibiting promoter 
activity which was subsequently designated P7 was cloned between the EcoRI and 
Bglll sites present in the expression cassette, creating pTREX7. This active promoter 
region had been previously isolated using the promoter probe vector pSB292 
10 (Waterfield et al 9 Gene, 165:9-15 (1995)). The promoter fragment was amplified by 
PCR using the Vent DNA polymerase according to the manufacturer. 

The pTREPl vector was then constructed as follows. An artificial DNA fragment 
which included a transcription terminator, the forward pUC sequencing primer, a 

15 promoter multiple -cloning site region and a universal translation stop sequence was 
created by annealing two overlapping partially complementary synthetic 
oligonucleotides together and extending with sequenase according to manufacturers 
instructions. The sense and anti-sense (pTREPF and pTREPR) oligonucleotides 
contained the recognition sites for EcoRV and BamHI at their 5' ends respectively to 

20 facilitate cloning into pTREX7. The transcription terminator was that of the Bacillus 
penicillinase gene, which has been shown to be effective in Lactococcus (Jos et aL y 
Applied and Environmental Microbiology, 50:540-542 (1985)). This was considered 
necessary as expression of target genes in the pTREX vectors was observed to be 
leaky and is thought to be the result of cryptic promoter activity in the origin region 

25 (Schofield et aL pers. corns. University of Cambridge Dept. Pathology.). The forward 
pUC primer sequencing was included to enable direct sequencing of cloned DNA 
fragments. The translation stop sequence which encodes a stop codon in 3 different 
frames was included to prevent translational fusions between vector genes and cloned 
DNA fragments. The pTREX7 vector was first digested with EcoRI and blunted using 

30 the 5' - 3* polymerase activity of T4 DNA polymerase (NEB) according to 
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manufacturer's instructions. The EcoRI digested and blunt ended pTREX7 vector was 
then digested with Bgl II thus removing the P7 promoter. The artificial DNA fragment 
derived from the annealed synthetic oligonucleotides was then digested with EcoRV 
and Bam HI and cloned into the EcoRI(blunted)-Bgl II digested pTREX7 vector to 
5 generate pTREP. A Lactococcus lactis MG1363 chromosomal promoter designated PI 
was then cloned between the EcoRI and Bglll sites present in the pTREP expression 
cassette forming pTREPl. This promoter was also isolated using the promoter probe 
vector pSB292 and characterised by Waterfield et aL, (1995), supra. The PI 
promoter fragment was originally amplified by PCR using vent DNA polymerase 
10 according to manufacturers instructions and cloned into the pTREX as an EcoRI-Bglll 
DNA fragment. The EcoRI-BglH PI promoter containing fragment was removed from 
pTREXl by restriction enzyme digestion and used for cloning into pTREP (Schofield 
et aL pers. corns. University of Cambridge, Dept. Pathology.). 

1 5 (b) PCR amplification of the 5. aureus nuc gene . 

The nucleotide sequence of the S. aureus nuc gene (EMBL database accession number 
V01 281) was used to design synthetic oligonucleotide primers for PCR amplification. 
The primers were designed to amplify the mature form of the nuc gene designated 

20 nucA which is generated by proteolytic cleavage of the N-terminal 19 to 21 amino 
acids of the secreted propeptide designated Snase B (Shortle, (1983), supra). Three 
sense primers (nucSl, nucS2 and nucS3, Appendix 1) were designed, each one having 
a blunt-ended restriction endonuclease cleavage site for EcoRV or Smal in a different 
reading frame with respect to the nuc gene. Additionally Bglll and BamHI were 

25 incorporated at the 5' ends of the sense and anti-sense primers respectively to facilitate 
cloning into BamHI and Bglll cut pTREPl. The sequences of all the primers are given 
in Appendix 1 . Three nuc gene DNA fragments encoding the mature form of the 
nuclease gene (NucA) were amplified by PCR using each of the sense primers 
combined with the anti-sense primer described above. The nuc gene fragments were 

30 amplified by PCR using 5. aureus genomic DNA template, Vent DNA Polymerase 



19 

(NEB) and the conditions recommended by the manufacturer. An initial denaturation 

step at 93 °C for 2 min was followed by 30 cycles of denaturation at 93 °C for 45 sec, 

annealing at 50 °C for 45 seconds, and extension at 73 °C for 1 minute and then a 

final 5 min extension step at 73 °C. The PCR amplified products were purified using a 
5 Wizard clean up column (Promega) to remove unincorporated nucleotides and 
primers. 

(c) Construction of the pTREPl-nuc vectors 

10 The purified nuc gene fragments described in section b were digested with Bgl II and 
BamHI using standard conditions and ligated to BamHI and Bglll cut and 
dephosphorylated pTREPl to generate the pTREPl-nucl, pTREPl-nuc2 and 
pTREPl-nuc3 series of reporter vectors. General molecular biology techniques were 
carried out using the reagents and buffer supplied by the manufacture or using 

15 standard conditions(Sambrook and Maniatis, (1989), supra). In each of the pTREPl- 
nuc vectors the expression cassette comprises a transcription terminator, lactococcal 
promoter PI, unique cloning sites (Bglll, EcoRV or Smal) followed by the mature 
form of the nuc gene and a second transcription terminator. Note that the sequences 
required for translation and secretion of the nuc gene were deliberately excluded in 

20 this construction. Such elements can only be provided by appropriately digested 

foreign DNA fragments (representing the target bacterium) which can be cloned into 
the unique restriction sites present immediately upstream of the nuc gene. 

In possessing a promoter, the pTREPl-nuc vectors differ from the pFUN vector 
25 described by Poquet et aL (1998), supra, which was used to identify L. lactis exported 
proteins by screening directly for Nuc activity directly in L. lactis. As the pFUN 
vector does not contain a promoter upstream of the nuc open reading frame the cloned 
genomic DNA fragment must also provide the signals for transcription in addition to 
those elements required for translation initiation and secretion of Nuc. This limitation 
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may prevent the isolation of genes that are distant from a promoter for example genes 
which are within polycistronic operons. Additionally there can be no guarantee that 
promoters derived from other species of bacteria will be recognised and functional in 
L. lactis. Certain promoters may be under stringent regulation in the natural host but 
5 not in L. lactis. In contrast, the presence of the PI promoter in the pTREPl-nuc series 
of vectors ensures that promoterless DNA fragments (or DNA fragments containing 
promoter sequences not active in L. lactis) will still be transcribed. 

(d) Screening for secreted proteins in S. pneumoniae 

10 

Genomic DNA isolated from S. pneumoniae was digested with the restriction enzyme 
Tru9I. This enzyme which recognises the sequence 5'- TTAA -3' was used because it 
cuts A/T rich genomes efficiently and can generate random genomic DNA fragments 
within the preferred size range (usually averaging 0.5 - 1.0 kb). This size range was 

15 preferred because there is an increased probability that the PI promoter can be utilised 
to transcribe a novel gene sequence. However, the PI promoter may not be necessary 
in all cases as it is possible that many Streptococcal promoters are recognised in L. 
lactis. DNA fragments of different size ranges were purified from partial Tru9I digests 
of S. pneumoniae genomic DNA. As the Tru 91 restriction enzyme generates staggered 

20 ends the DNA fragments had to be made blunt ended before ligation to the EcoRV or 
Smal cut pTREPl-nuc vectors. This was achieved by the partial fill-in enzyme 
reaction using the 5-3' polymerase activity of Klenow enzyme. Briefly Tru9I digested 
DNA was dissolved in a solution (usually between 10-20 \xl in total) supplemented 
with T4 DNA ligase buffer (New England Biolabs; NEB) (IX) and 33 of each of 

25 the required dNTPs, in this case dATP and dTTP. Klenow enzyme was added (1 unit 
Klenow enzyme (NEB) per [ig of DNA) and the reaction incubated at 25°C for 15 
minutes. The reaction was stopped by incubating the mix at 75°C for 20 minutes. 
EcoRV or Smal digested pTREP-nuc plasmid DNA was then added (usually between 
200-400 ng). The mix was then supplemented with 400 units of T4 DNA ligase (NEB) 
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and T4 DNA ligase buffer (IX) and incubated overnight at 16°C. The ligation mix 
was precipitated directly in 100% Ethanol and 1/10 volume of 3M sodium acetate (pH 
5.2) and used to transform L, lactis MG1363 (Gasson, 1983). Alternatively, the gene 
cloning site of the pTREP-nuc vectors also contains a Bglll site which can be used to 
5 clone for example Sau3AI digested genomic DNA fragments. 

L. lactis transformant colonies were grown on brain heart infusion agar and nuclease 

secreting (Nuc + ) clones were detected by a toluidine blue-DNA-agar overlay (0.05 M 
Tris pH 9.0, 10 g of agar per litre, 10 g of NaCl per liter, 0.1 mM CaC12, 0.03% 
wt/vol. salmon sperm DNA and 90 mg of Toluidine blue O dye) essentially as 
10 described by Shortle, 1983, supra and Le Loir et al. y 1994, supra). The plates were 

then incubated at 37°C for up to 2 hours. Nuclease secreting clones develop an easily 

identifiable pink halo. Plasmid DNA was isolated from Nuc + recombinant L. lactis 
clones and DNA inserts were sequenced on one strand using the NucSeq sequencing 
primer described in Appendix 1, which sequences directly through the DNA insert. 

15 

Isolation of Genes Encoding Exported Proteins from 
S. pneumoniae 

A large number of gene sequences putatively encoding exported proteins in S. 
20 pneumoniae have been identified using the nuclease screening system. These have 
now been further analysed to remove artefacts. The sequences identified using the 
screening system have been analysed using a number of parameters. 

1. All putative surface proteins were analysed for leader/signal peptide 
25 sequences using the software programs Sequencher (Gene Codes Corporation) and 
DNA Strider (Marck, Nucleic Acids Res., 16:1829-1836 (1988)). Bacterial signal 
peptide sequences share a common design. They are characterised by a short 
positively charged N-terminus (N region) immediately preceding a stretch of 
hydrophobic residues (central portion-h region) followed by a more polar C-terminal 
30 portion which contains the cleavage site (c-region). Computer software is available 
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which allows hydropathy profiling of putative proteins and which can readily identify 
the very distinctive hydrophobic portion (h-region) typical of leader peptide 
sequences. In addition, the sequences were checked for the presence of or absence of a 
potential ribosomal binding site (Shine-Dai garno motif) required for translation 
5 initiation of the putative nuc reporter fusion protein. 

2. All putative surface protein sequences were also matched with all of the 
protein/DNA sequences using the publicly databases [OWL-proteins inclusive of 
SwissProt and GenBank translations]. This allows us to identify sequences similar to 
known genes or homologues of genes for which some function has been ascribed. 

10 Hence it has been possible to predict a function for some of the genes identified using 
the LEEP system and to unequivocally establish that the system can be used to 
identify and isolate gene sequences of surface associated proteins. We should also be 
able to confirm that these proteins are indeed surface related and not artifacts. The 
LEEP system has been used to identify novel gene targets for vaccine and therapy. 

15 3. Some of the genes identified proteins did not possess a typical leader peptide 

sequence and did not show homology with any DNA/protein sequences in the 
database. Indeed these proteins may indicate the primary advantage of our screening 
method, i.e. the isolation of atypical surface-related proteins, which may have been 
missed in all previously described screening protocols or approaches based on 

20 sequence homology searches. 

In all cases, only partial gene sequences were initially obtained. Full length genes were 
obtained in all cases by reference to the TIGR S. pneumoniae database 
(www@tigr.org) . Thus, by matching the originally obtained partial sequences with the 

25 database, we were able to identify the full length gene sequences. In this way, as 
described herein, three groups of genes were clearly identified, ie a group of genes 
encoding previously unidentified S. pneumoniae proteins, a second group exhibiting 
some homology with known proteins from a variety of sources and a third group 
which encoded known S.pneumoniae proteins, which were, however, not known as 

30 antigens. 
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Example 2: Vaccine trials 
pcDNA3.1+ as a DNA vaccine vector 

5 

pcDNA3.1+ 

The vector chosen for use as a DNA vaccine vector was pcDNA3.1 (Invitrogen) 
(actually pcDNA3.1+, the forward orientation was used in all cases but may be 

10 referred to as pcDNA3.1 here on). This vector has been widely and successfully 

employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et aL, Kurar and Splitter, Anderson et aL). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNA3.1 contains the ColEl origin of replication which allows 

15 convenient high-copy number replication and growth in E. coli. This in turn allows 
rapid and efficient cloning and testing of many genes. The pcDNA3.1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter/enhancer which permits efficient, high-level expression of 

20 the recombinant protein. The CMV promoter is a strong viral promoter in a wide 

range of cell types including both muscle and immune (antigen presenting) cells. This 
is important for optimal immune response as it remains unknown as to which cells 
types are most important in generating a protective response in vivo. A T7 promoter 
upstream of the multiple cloning site affords efficient expression of the modified 

25 insert of interest and which allows in vitro transcription of a cloned gene in the sense 
orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.C. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
30 immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L12 gene elicits immune response". Vaccine^ 15, 1851-57. 

35 

Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, G. 
(1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity ; 64, 3168-3173. 

40 Preparation of DNA vaccines 



Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing those 
sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNA3.1 expression vector. The signal peptide directs the polypeptide precursor 
to the cell membrane via the protein export pathway where it is normally cleaved off 
by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the signal peptide 
10 does not make up any part of the mature protein whether it be displayed on the surface 
of the bacteria surface or secreted. Where an N-terminal leader peptide sequence was 
not immediately obvious, primers were designed to target the whole of the gene 
sequence for cloning and ultimately, expression in pcDNA3.1. 

15 Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the design 
of oligonucleotides. These features included: 

20 1. LPXTG (SEQ ID NO: 182) cell wall anchoring motifs. 

2. LXXC (SEQ ID NO: 197) ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

4. Where no N-terminal signal peptide or LXXC (SEQ ID NO: 197) was present the 
start codon was excluded. 

25 5. Where no hydrophobic C-terminal domain or LPXTG (SEQ ID NO: 182) motif was 
present the stop codon was removed. 

Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 

30 these primers. The primers were designed with the appropriate enzyme restriction site 
followed by a conserved Kozak nucleotide sequence (in most cases(NB except in 
occasional instances for example ID59) GCCACC was used. The Kozak sequence 
facilitates the recognition of initiator sequences by eukaryotic ribosomes) and an ATG 
start codon upstream of the insert of the gene of interest. For example the forward 

35 primer using a BamHl site the primer would begin GCGGGATCCGCCACCATG 
(SEQ ID NO: 183) followed by a small section of the 5' end of the gene of interest. 
The reverse primer was designed to be compatible with the forward primer and with a 
Notl restriction site at the 5' end in most cases (this site is TTGCGGCCGC) (SEQ ID 
NO: 184) (NB except in occasional instances for example ID59 where a Xhol site was 

40 used instead of Notl). 

PCR primers 
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The following PCR primers were designed and used to amplify the truncated genes of 
interest. 

ID5 

5 

Forward Primer (SEQ ID NO: 185) 

5' CGGATCCGCCACCATGGGTCTAATTGAAGACTTAAAAAATCAA 3" 

Reverse Primer (SEQ ID NO: 186) 
10 5' TTGCGGCCGCCAATGCTAGACTAAACACAAGACTCA 3' 

ED59 

Forward Primer (SEQ ID NO: 187) 
15 5' CGCGGATCCATGAAAAAAATCTATTCATTTTTAGCA 3' 
Reverse Primer (SEQ ID NO: 188) 

5' CCCTCGAGGGCTACTTCCGATACATTTTAAACTGTAGG 3' 



20 ID51 

Forward Primer (SEQ ID NO: 189) 

5' CGGATCCGCCACCATGAGTCATGTCGCTGCAAATG 3' 
Reverse Primer (SEQ ED NO: 190) 
25 5' TTGCGGCCGCATACCAAACGCTGACATCTACG 3' 

BD29 

Forward Primer (SEQ ID NO: 191) 
30 5' CGGATCCGCCACCATGCAAAAAGAGCGGTATGGTTATG 3' 
Reverse Primer (SEQ ID NO: 192) 
5' TTGCGGCCGCACCCCCATTCTTAATCCCTT 3' 

ED50 

35 

Forward Primer (SEQ ID NO: 193) 

5* CGGATCCGCCACCATGGAGGTATGTGAAATGTCACGTAAA 3' 
Reverse Primer (SEQ ID NO: 194) 

5' TTGCGGCCGCTTTTACAAAGTCAAGCAAAGCC 3' 

40 

Cloning 



The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 1 1886 
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obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 
pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
5 scale (>1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 

maintenance of genes was confirmed by restriction mapping and sequencing -700 
base pairs through the 5' cloning junction of each large scale preparation of each 
construct. 

10 Strain validation 

A strain of type 4 was used in cloning and challenge methods which is the strain from 
which the 5. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 11886 was 

15 obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% blood). 
The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 5% 
blood) and incubated statically overnight at 37°C. This culture was streaked on to 5% 
blood agar plates to check for contaminants and confirm viability and on to blood agar 

20 slopes and the rest of the culture was used to make 20% glycerol stocks. The slopes 
were sent to the Public Health Laboratory Service where the type 4 serotype was 
confirmed. 

A glycerol stock of NCTC 1 1886 was streaked on a 5% blood agar plate and 
25 incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

30 A standard inoculum of type 4 5. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus lx through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10 9 cfu/ml 
in broth before freezing down. The preparation is set out below as per the flow chart. 

35 Streak pneumococcal culture and confirm identity 

i 

V 

40 Grow over-night culture from 4-5 colonies on plate above 
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Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 



Grow over-night culture from animal passaged pneumococcus 



Grow day culture (to pre-determined optical density) from over-night of animal 
passage and freeze down at -70°C - This is standard minimum 

I 

V 

Thaw one aliquot of standard inoculum to viable count 

I 

V 

Use standard inoculum to determine effective dose (called Virulence Testing) 

I 

V 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10 5 cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

S. pneumoniae Vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six and 
each group was immunised with recombinant pcDNA3.1+ plasmid DNA containing a 
specific target-gene sequence of interest. A total of 100 \ig of DNA in Dulbecco's PBS 
(Sigma) was injected intramuscularly into the tibialis anterior muscle of both legs (50 
\i\ in each leg). A boost was carried using the same procedure 4 weeks later. For 
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comparison, control groups were included in all vaccine trials. These control groups 
were either unvaccinated animals or those administered with non-recombinant 
pcDNA3.1+ DNA (sham vaccinated) only, using the same time course described 
above. 3 weeks after the second immunisation, all mice groups were challenged intra- 
5 nasally with a lethal dose of S. pneumoniae serotype 4 (strain NCTC 1 1886). The 
number of bacteria administered was monitored by plating serial dilutions of the 
inoculum on 5% blood agar plates. A problem with intranasal immunisations is that in 
some mice the inoculum bubbles out of the nostrils, this has been noted in results table 
and taken account of in calculations. A less obvious problem is that a certain amount 

10 of the inoculum for each mouse may be swallowed. It is assumed that this amount will 
be the same for each mouse and will average out over the course of inoculations. 
However, the sample sizes that have been used are small and this problem may have 
significant effects in some experiments. All mice remaining after the challenge were 
killed 3 or 4 days after infection. During the infection process, challenged mice were 

15 monitored for the development of symptoms associated with the onset of S. 

pneumoniae induced-disease. Typical symptoms in an appropriate order included 
piloerection, an increasingly hunched posture, discharge from eyes, increased lethargy 
and reluctance to move. The latter symptoms usually coincided with the development 
of a moribund state at which stage the mice were culled to prevent further suffering. 

20 These mice were deemed to be very close to death, and the time of culling was used to 
determine a survival time for statistical analysis. Where mice were found dead, the 
survival time was taken as the last time point when the mouse was monitored alive. 

Interpretation of Results 

25 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 
challenge. Protection was taken as those DNA sequences that gave statistically 
significant protection (to a 95% confidence level (p<0.05)) and also those which were 

30 marginal or close to significant using Mann-Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 
to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 

35 intranasal infections. 
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Results 

Trials 1-6 (see figure 1) 



Mouse 
number 


Mean survival times (hours) 


Unvacc 
control (1) 


pcDNA 
3.1+ (1) 


ID5 (1) 


Unvacc 
control (2) 


ID59 (2) 


Unvacc 
control (5) 


1 


1 


47.5 


61.0 


61.0 


49.0 


55.0 


58.0 




2 


57.0 


47.5 


61.0 


51.0 


55.0 


75.0 




3 


47.5 


50.5 


57.0 


49.0 


55.0 


48.0 




4 


47.5 


50.5 


72.0 


55.0 


69.5 


46.7 




5 


77.0 


72.0 


47.5 


49.0 


74.0 


58.0 




6 


57.0 


50.5 


mouse died 


49.0 


mouse died 


75.0 




Mean 


55.6 


55.3 


59.7 


50.3 


61.7 


60.1 




sd 


11.5 


9.4 


8.8 


2.4 


9.3 


12.5 




p value 1 






0.1722 




0.0064 






p value 2 






0.2565 











- bubbled when dosed so may not have received full inoculum. 



5 T - terminated at end of experiment having no symptoms of infection. 

Numbers in brackets - survival times disregarded assuming incomplete dosing 

p value 1 refers to significance tests compared to unvaccinated controls 

p value 2 refers to significance tests compared to pcDNA3.1+ vaccinated controls 

10 Statistical Analyses. 

Trial 1 - None of the other groups had significantly longer survival times than the 
controls. The survival times of the unvaccinated and pcDNA3.1 control groups were 
not significantly different. One of the mice from IDS was an outlying result and the 
mean survival times for BD5 were extended but not significantly so. 

15 Trial 2 - The group vaccinated with ID59 had significantly longer survival times than 
the unvaccinated control group. 

Trial 5 - The group vaccinated with ID59 again survived for an average of almost 10 
hours longer than the controls but the results were not quite statistically significant. 
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Trial 6 - The group vaccinated with ID51 did not have survival times significantly 
higher than unvaccinated controls (p=<36.0), however, there were 2 outlying mice in 
the vaccinated group. 

Vaccine trials 7 and 8 (See figure 2) 



Mouse 
number 


Mean survival times (hours) 


Unvacc 
control (7) 


ID29 (7) 


Unvacc 
control (8) 


ID50 (8) 


1 


59.6 


73.1 


45.1 


60.6 


2 


47.2 


54.8 


50.8 


60.6 


3 


59.6 


59.3 


60.4 


51.1 


4 


70.9 


54.8* 


55.2 


60.6 


5 


68.6* 


59.3 


45.1 


60.6 


6 


76.0 


54.8 


45.1 


60.6 


Mean 


63.6 


59.35 


50.2 


59.1 


sd 


10.3 


7.1 


6.4 


3.9 


p value 1 




<39.0 




0.0048 



* - bubbled when dosed so may not have received full inoculum. 
T - terminated at end of experiment having no symptoms of infection. 
Numbers in brackets - survival times disregarded assuming incomplete dosing 
p value 1 refers to significance tests compared to unvaccinated controls 

Statistical Analyses. 

Trial 7 - The ID29 vaccinated group showed prolonged times to the first death. T 
Trial 8 - The group vaccinated with ID50 survived significantly longer than 
unvaccinated controls. 
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Appendix I - Oligonucleotide primers 

nucSl 

Bgl n Eco RV 

5- cgagatctgatatctcacaaacagataacggcgtaaatag -3 r (SEQ ID NO: 171) 
nucS2 

Bgl U Sma I 

5 - gaagatcttccccgggatcacaaacagataacggcgtaaatag -3' (SEQ ID NO: 172) 
nucS3 

Bgl H Eco RV 

5- cgagatctgatatccatcacaaacagataacggcgtaaatag -3' (SEQ ID NO: 173) 
nucR 

Bam HI 

5'- cgggatccttatggacctgaatcagcgttgtc -3' (SEQ ID NO: 174) 
NucSeq 

5 - ggatgctttgtttcaggtgtatc -3' (SEQ ID NO: 175) 
pTREPF 

5 - catgatatcggtacctcaagctcatatcattgtccggcaatggtgtgggctttttttgttttagcggataa 
caatttcacac -3' (SEQ ID NO: 176) 

pTREPR 

5 - gcggatcccccgggcttaattaatgtttaaacactagtcgaagatctcgcgaattctcctgtgtgaaatt 
gttatccgcta ~y (SEQ ID NO: 177) 

pUCF 

5- cgccagggttttcccagtcacgac -3* (SEQ ID NO: 178) 
VR 

5- tcaggggggcggagcctatg -3' (SEQ ID NO: 179) 
Vi 

5- tcgtatgttgtgtggaattgtg -3' (SEQ ED NO: 180) 
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tccggctcgtatgttgtgtggaattg -3' (SEQ ED NO: 181) 
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TABLE 1 



ID4 1200 bp 

(SEQ ID NO: 1) 

ATG AG A AAT ATGTGGGTTGT A ATC A AGG A A ACCT ATCTTCG AC ATGTCG AGTC ATGG AGTTTCTTCTTT ATGGTG AT 
TTCGCCGTTCCTCTTTTTAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGTTCTTCTATG 

TGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAAAG 

ACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATAGTG 

TTCTAAAGCKrAGTTTATCATGGCGAAACATCGCrrGAAAATGGAATTAAATTTGAGGTTACAGGTACACTCAATGA 

ACTGCAAAATCAGCTTAATCGTTCAACTGCTTCCTTGTCTCAAGAGCAGGAAAAACGCTTAGCGCAGACAATTCAA 

TTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTGCCTTAGGATTCT 

TTCTTTAT ATGATTC TGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAAAATTAT 

GGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCTCTGTTTCTAGTGATTTTAA 

cck:atattgggatctatgttgtaggtggtctggctgccgttttgctctttaaagatttgccattcttggct 

GTATTTTGGATCACTTGGGAGATGCTATCTCACTGAATACCTTGCTCTTTATTTTGATCAGTCTTTTCATGTACGTAG 

TCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCGTCCTGAGGACTCAGGGAAAGCCTTGTCGCCTTTGATGATT^ 

ATTATGGGTGGTTTTTTTGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCTCTTGAAGATTGGTTCTTATA^ 

TCCCTTTATTTCGACCTTCTTTATGCCGTTTCGAACGATTAATGACTATGCGGGGGGAGCAGAAGCATGGATTTC 

TTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTAGTCTCGTTCTTCAAACG 

G ATG ATTT AGGG ATTTGG A A A A CCTTT A A ACGTGCCTT ATCTT AT A A AT AG 

(SEQ ID NO: 2) 

MRNM\VVVIK£TYLRHVESWSFFFMVISPFLFLGISVGIGHLQGS^^ 

ASAK£AIK£EKLKGYLTIDQEDSVLKj\VYHGETSI^NGIKFEVTGTLNELQNQLNRSTASL^QEQEKR1^QTIQFTEKIDE 

akenkklfiqtiaagalgfflymtlityagvtaqevasekgtkrmevvfssirashyfyarmmalflvilthigiyvvggla 

avllfkdlpflaqsgildhlgdaisu^tllfilislfmyvvlaaflgsmvsrpedsgkalsplmilimggffgvtalgaag 

dnuj-kigsyipfistf™pfrtindyaggaeawislaitvifavvatgfigrmyaslvlqtddlgiwktfkralsyk 

IDS 1125 bp 

(SEQ ID NO: 3) 

cctgggaaagtcttgaaaattatgatagaatggtggaaggaaaaattcaggagagtagtagtgactcaaaatgtt 

gaaagtcttctcgtatccattgtaatcagtgcatacaatgaagaaaaatatctgcctggtctaattgaagacttaa 

aaaatcaaacctatcctaaagaggatattgaaattctatttataaatgctatgtccacagatgggaccacagctat 

cattc ack:a atttataaaggaagatacagagtttaactcaattagattgtataacaatcctaagaaaaatcaagct 

agtggttttaacctgggagttaaacattctgtaggggaccttattttaaaaattgatgctcattcaaaagttactga 

gacttttgtaatgaacaatgtggctattattcaacaaggtgaatttgtctgtggggggcctagaccgacgattgtc 

gaaggaaaaggaaaatgggcagagaccttck^atcttgttgaggaaaatatgtttggcagtagcattgccaattat 

cgaaatagttctgaggatagatatgtttcttctatttttcatggaatgtataaacgagaggttttccagaaggttgg 

tttagtaaatgagcaacttggccgaactgaagataatgatattcattatagaattcgagaatatggttataaaatc 

cgctatagcccaagtattctatcttatcagtatattcgaccaacattcaagaaaatgctgcatcaaaagtattcaa 

atgg tttgtggattggcttgacaagtcatgttcagtttaagtgtttatc 

ttttgagtcttgtgtttagtctack:attgttaccgatcac 

acttttgtcattactcactttgctgactttattaaaacataaaaatggatttctaattgtgatgcccttta 

ttccattcactttck:ttatggccttgggacgattgtaggtttaattagaggatttaaatggaaga 

agaacaataatttatttggataaaataagccaaataaatcaaaatatgctataa 

(SEQ ID NO: 4) 

PGKVLKIMIEWWK£KFRRVVVTQNVESLLVSIVISAYNEEKYLPGLIEDLKNQTYPKEDIEILFINAMSTDGTTAIIQQFIK 

EDTEFNSIRLYNNPKKNQASGFNLGVKHSVGDLILKIDAHSKVTETFVMNNVAIIQQGEFVCGGPRPTIVEGKGKWAETL 

HLVEENMFGSSIANYRNSSEDRYVSSIFHGMYFCREVFQKVGLVNEQLGRTEDNDIHYRIREYGYKIRYSPSILSYQYIRPT 

FKKMLHQKYSNGLWIGLTSHVQFKCl^LFHYVPCLFVLSLVFSLALLPITFVFrTLLLGAYFLLLSLLTLLTlXKHKNGFLI 

VMPFILFSIHFAYGLGTIVGLIRGFKWKKEYKRTHYLDKISQINQNML 

11)11696 bp 

(SEQ ID NO: 5) 

atgatgaaagaacaaaatacgatagaaatcgatgtatttcaattagttaaaagcttgtggaaacgcaagctaatg 

attttaatagtggcacttgtgacaggtgcgggggcttttgcatatagcacttttattgttaagccagaatatacgag 

taccacgcgaatttacgtagtgaatcck:aatcaaggagacaagccggggttgacaaatcaggatttgcaggcagg 

aacttatctggtaaaagactaccgtgagattatcctttcgcaggatgttttggaggaagttgtttct 

tagatttgacgccaaaaggtttggctaataaaattaaagtgacagtaccagttgatacccgtattgtctctatttca 

gttaatgatcgagttcctgaagaggcaagccgtatcgctaactctttgagagaagtagctgctcaaaaaattatca 
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GTATTACTCGTGTTTCTGACGTGACAACACTGGAGGAGGCAACKjCCCKjCGATATCCCCGTCTTCGCCAAATATTAA 

acgc a atac act a attgg' i" i" i " i" i"i ggc aggggtg attgg aact agtgtt atagttcttc atcttg a acttttgg at a 

ctcgtgtgaaacgtccggaagatatcgaaaatacattck:agatgacacttttgggagttgtck:caaacttggg^ 

gttgaaatag 

(SEQ ID NO: 6) 

MMKEQ^IEIDVFQLVKSLWKJ^KIJS/IILIVALVTGAGAFAYSTFIVK^ 

VKX)YREIIl^QDVLEEVVSDLKLDLTPKGLANKIKVTVPVDTRrVSISVNDRVPEEASRIANSLREVAAQKIISrmVSDVTT 
LEEARPAISPSSPNIKRNTLIGFLAGVIGTSVIVLHLELLDTRVKRPEDIENTLQMTLLGVVPNLGKLK 

ID19 555 bp 



(SEQ ID NO: 7) 

atggtaaaagtagcagttatattagctcagggctttgaagaaattgaagccnt 

gagccaatatcacatgtgatatggttggttttgaagagcaagtaacgggttcgcatgcaatccaagtaagagcag 

atcatgtctttgatggagatttatcagactatgatatgattgttcttcctggaggtatgcctggttctgcacatttac 

gtgataatcagaccttgattcaagaattgcaaack:ttcgagcaagaagggaagaaactagcack:catttgtgcgg 

caccaattgccctcaatcaagcagagatattgaaaaataagcgatacacttgttatgacggcgttcaagagcaaat 

ccttgatggtcactacgtcaaggaaacagtagtggtagatggtcagttgacaaccagtcggggtccttcaacagcc 

cttgcctttgcctacgagttggtggackzaactaggaggggacgcagagagtttacgaacaggaatgctctatcgag 

atgtctttggt a a a a atc agt a a 

(SEQ ID NO: 8) 

MVKVAVILAQGFEEIEALTVVDVLRRANITCDMVGFEEQVTGSHAIQVRADHVFDGDLSDYDMIVLPGGMPGSAHLRD 
NQTLIQELQSFEQEGKKLAAICAAPIALNQAEILKNKRYTCYDGVQEQILDGHYVKETVVVDGQLTTSRGPSTALAFAYE 
LVEQLGGDAESLRTGMLYRDVFGKNQ 

ID27 306 bp 

(SEQ ID NO: 9) 

gtggtagggatggtagaaccaaacctagaaagccttataaaagatctttacaatcatgctcgacatgatttgagtg 
aagatttagttgctgctctcctagagactactaaaaaactgcctactacaaatgagcaattgcaggcagttcgtct 
ctc aggcctggtc a atcgtg a attgctcct a a atccc a a ac atcc agc acctg agttgctc a acttggctcgctttg 
tcaaaagagaagaagccaagtacagaggaactgcgacttctgcck:ttatgtatgaggaactctttaaaatgctttg 

A 

(SEQ [D NO: 10) 

MVGMVEPNI^UKDLYNHARHDLSEDLVAALLETTKKIJ^TTNEQ 
EEAKYRGTATSALMYEELFKML 

ID29 945 bp 

(seq id no: ii) 

ttgttcttaaaaaaggaaagagaggtaatcagcatck:gtaaatggacaaaaggatttctcatctttggtgtggtga 

ctaccgttatcggctttatcctgctttttgtaggtatccaatctgacgggattaagagcctactttccatgtccaaag 

aacctgtctatgatagccgtacggaaaagctaacctttggcaaggaagtcgaaaacctagaaattactctccacca 

acacacgctcaccatcacagactctttcgatgatcaaatccacatttcttaccatccatctctttctgctcaccatg 

atcttatcaccaatcagaacgatagaactctgagtctcactgataagaaactgtctgaaactccgtttctctcttct 

ggaattggtckjgattcttcatatcgcaagtagctactctagtcgttttgaagaagttattctccgactaccaaaagg 

gagaactctaaaagggatcaacatctcagccaatcgcggacaaaccaccatcataaatgctagccttgaaaatgc 

gaccctcaatacaaacagctatatcctccgaattgaaggaagtcgtatcaaaaacagtaaactcacaacgcccaat 

atcgttaatatctttgatacagttcttacagatagtcagctagagtcaacagagaatcacttccacgctgaaaata 

tccaagtccatggcaaggttgaactgactgccaaagattatctcagaatcatcctagaccagaaagaaagccaac 

gaattaactgggacatctcaagcaactatggttctatcttccaattcacaagagaaaagcctgaatcaagaggtac 

ggaattaagcaacccttacaaaactgaaaaaaccgatgtcaaggatcaactcattgcgagatctgatgataatatt 

gatctaatatccacaccaagcagacgttga 

(SEQ ID NO: 12) 

MFLKKEREVISMRKWTKGFTJFGVVTT^IGFILLFVGIQSDG 

TDSFDDQIHISYHPSLSAHHDLITNQNDRTLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRLPKGRTLKGINISANR 
GQTTIINASLENATLNTNSYILR IEGSRIKNSICLTTPNIVNIFDTVLTDSQLESTENHFHAENIQVHGKVELTAKDYLRIILD 
QKESQRINWDISSNYGSIFQFTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTPSRR 

ID30 879 bp 



(SEQ ID NO: 13) 
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ATGAAACAAGAATGGTTTGAAAGTAATGATTTTGTAAAAACAACAAGCAAGAACAAGCCTGAAGAGCAAGCTCAA 

GAGGTTGCAGACAAGGCTGAAGAAACGATAGCCGATCTCGATACACCAATTGAAAAAAATACTCAGTTAGAGGAG 

GAAGTCCCTCAAGCTGAAGTCGAATTGGAAAGCCAGCAAGAAGAGAAAATTGAAGCTCCTGAAGACAGTGAAGCG 

AGAACAGAAATAGAAGAAAAGAAGGCATCTAATTCTACTGAAGAAGAGCCAGACCTTTCTAAAGAAACAGAAAA 

AGTCACTATAGCTGAAGAGAGCCAAGAAGCTCTTCCTCAGCAAAAAGCAACCACGAAAGAGCCACTTCTTATCAG 

TAAATCTTTAGAAAGTCCTTATATCCCCGACCAAGCTCCAAAATCTAGGGATAAATGGAAAGAGCAAGTGCTTGAT 

TTTTGGTCTTGGCTAGTGGAAGCGATCAAATCTCCTACAAGTAAGTTGGAAACAAGTATCACACACAGTTACACAG 

CCTTTCTCTTGCTCATTCTGTTTTCTGCATCTTCCTTTTTCTTT 

TATAGCAAGCATTAA CAGTC GCTTCCCTGAGCAGCTAGCTCCTTTAACTCTTTTT^ 
GACAACACTCTTCTTCTTTTCATTCCTCTTGG 

CGCTAGACAAGGTTCTCCAACAATATAGTCAACTCTTGGCAATTCCAATCTC 
CTTTCTTTGATAGCCTACGATTTACAGCCCTCTTGTGTGTGA 

(SEQ ID NO: 14) 

MKQEWFESNDFVKTTSKNKPEEQAQEVADKA£ETIADLDTPIEK^TQL£EEVPQAEVELESQQEEKIEAPEDSEARTEIEE 
KKj\SNSTEEEPDLSKETEKVTL\EESQEALPQQKATTK£PmSKSLESPYIPDQAPKSRDKWK£QVLDFWSWLVEAIKSP 
TSKL£TSrTHSYTAFLLLILFSASSFFFSIYHIKHAYYGHIASINSRFPEQLAPLTLFSIISILVATTLFFFSFLLGSFVVRRFIHQ 
EKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSLIAYDLQPSCV 

ID105 990 bp 



(SEQ ID NO: 15) 

atgcaactcck:ttcttcggtctactcattgttcgtctggtacaatttgttcttaaaaaaggaaagagaggtaatcag 

catgcgtaaatggacaaaaggatttctcatctttggtgtggtgactaccgttatcggctttatcctgcnttttgtag 

gtatccaatctgacgggattaagagcctactttccatgtccaaagaacctgtctatgatagccgtacggaaaagct 

aacctttggcaaggaagtcgaaaacctagaaattactctccaccaacacacgctcaccatcacagactctttcgat 

gatcaaatccacatttcttaccatccatctctttctgctcaccatgatcttatcaccaatcagaacgatagaactct 

gagtctcactgataagaaactgtctgaaactccgtttctctcttctggaattggtgggattcttcatatcgcaagta 

gctactctagtcgttttgaagaagttattctccgactaccaaaagggagaactctaaaagggatcaacatctcagc 

caatcgcggacaaaccaccatcataaatgctagccttgaaaatgcgaccctcaatacaaacagctatatcctccga 

attgaaggaagtcgtatcaaaaacagtaaactcacaacgcccaatatcgttaatatctttgatacagttcttacag 

atagtcagctagagtcaacagagaatcacttccacgctgaaaatatccaagtccatggcaaggttgaactgactgc 

caaagattatctcagaatcatcctagaccagaaagaaagccaacgaattaactgggacatctcaagcaactatgg 

ttctatcttccaattcacaagagaaaagcctgaatcaagaggtacggaattaagcaacccttacaaaactgaaaa 

aaccgatgtcaaggatcaactcattgcgagatctgatgataatattgatctaatatccacaccaagcagacgttga 

(SEQ ID NO: 16) 

MQLASSVYSLFVWYNLFLKK£REVISMRK\\TTKGmFGVVTTVIGFILLFVGIQSDGIKSLLSMSKEPVYDSRTEKLTFGK 
EVENLEITLHQHTLTITDSFDDQIHISYHPSLSAHHDLITNQNDRTLSLTDKKLSETPFLSSGIGGILHIASSYSSRFEEVILRL 
PKGRTLKGINISANRGQTTIINASLENATLNTNSYILRIEGSRIKNSK1.TTPNIVNIFDTVLTDSQI^STENHFHAENIQVHG 
KVELTAKDYLRIILDQICESQRINWDISSNYGSIFQFTREKPESRGTELSNPYKTEKTDVKDQLIARSDDNIDLISTPSRR 

ID107-78bp 

(SEQ ID NO: 17) 

ATGATATGTAAAATGAAGCAGGGAGGGAGCAGGCK^GTGCTGGGGATGGAGAGTGGGGGAGGGACGCTGCTATTTT 
AATC 

(SEQ ID NO: 18) 

MICKMKQGGSRACWGWRVGEGRCYFN 
ID109 714 bp 



(SEQ ID NO: 19) 

cgataaagaggccttgagtaatctcaatttck:agattgaaaatggagagattatgggcttgattggtcataatggg 

gctggaaaatcgaccactataaaatccctagtcagtatcatttcacccagcagtggtcgtattttggtagacggtc 

aggagttatcggaaaatcgcttggctattaaacgaaagattggctacgtagcagactcgcctgacttatttttacg 

cttaacggccaatgaattttgggaattgatcgcctcatcctatgatctgagtagatctgacttggaggctagtctag 

ctaggctattgaacgtttttgattttgctgaaaatcgct^ 

aaagtctttgtcatcggagcactcttgtctgatcccgatatttgggttttggacgaacccttgactggtt^ 

ccagck:tgcctttgatttgaaacagatgatgaaggaacatck:acaaaaagggaagacagtcttgttttcaactcat 

gtcctagaggtggcagagcaagtctgtgatcggattgccattttgaaaaaggggcatttgatttattgtggtaagg 

tagaggacttgaggaaagaccacccagaccagtctttggaaagtatctaccttagtcttgctggtagaaaagagg 

aggttgcggatgcgtctcaaggtcattaa 



(SEQ ID NO: 20) 
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DK£ALSNLNLQIENGEIMGLIGHNGAGKSTT1KSLVSI!SPSSGR1LVDGQEI^ENRLAIKRKIGYVADSPDLFLRLTANEF 
\\nELIASSYDl^RSDl^ASI^RLUWroFAENRYQVEETl^HG 

MMKEHAQKGKTVLFSTHVIXVAEQVCDRIAILKKGHLIYCGKVEDLRKBHPIXJSl^ryLSl^GRK^EVADASQG^ 
ID112 360bp 

(SEQIDNO:21) 

atggcittgttttcagagagaggagcagtacggaagacaccaatggcaagtc^ 

cgacgatagagattaaaagagtgataccack:accacgcaagagttgttgccagttttcagaaagaattttagcaa 
cttggctaaagaaactactgctagtctcttcagttgttgtag<^ 

a aggc a acttggtc atcttttg a aatck3tttca atgctggc attg atttggcta atacgattgtc a i" i" it i acg a ag 
cccgatagcgatagctgtatcttcttccccagttttgaaaccaggttctacttga 

(SEQ ID NO: 22) 

MALFSERGAVRKTPMASPIMRPMMVFTEIKRVIPAPRKSCCQFS^ 
FEMVSMLALIWLIRLSFLRSPIAIAVSSSPVLKPGST 

ID 128 - 3.43 

(SEP ID NO: 23) 

atgaaatttagtaaaaaatatatagcagctggatcagctgttatcgtatc 

cttgagtctatgtgcctatgcactaaaccagcatcgttcgcaggaaaata 

aggacaataatcgtgtctcttatgtggatggcagccagtcaagtcagaaa 

agtgaaaacttgacaccagaccaggttagccagaaagaaggaattcaggc 

tgagcaaattgtaatcaaaattacagatcagggctatgtaacgtcacacg 

gtgaccactatcattactataatgggaaagttccttatgatgccctcttt 

agtgaagaactcttgatgaaggatccaaactatcaacttaaagacgctga 

tattgtcaatgaagtcaagggtggttatatcatcaaggtcgatggaaaat 

attatgtctacctgaaagatgcagctcatgctgataatgttcgaactaaa 

gatgaaatcaatcgtcaaaaacaagaacatgtcaaagataatgagaaggt 

taactctaatgttgctgtagcaaggtctcagggacgatatacgacaaatg 

atggttatgtctttaatccagctgatattatcgaagatacgggtaatgct 

tatatcgttcctcatggaggtcactatcactacattcccaaaagcgattt 

atctgctagtgaattagcagcagctaaagcacatctggctggaaaaaata 

tgcaaccgagtcagttaagctattcttcaacagctagtgacaataacacg 

caatctgtagcaaaaggatcaactagcaagccagcaaataaatctgaaaa 

tctccagagtcttttgaaggaactctatgattcacctagcgcccaacgtt 

acagtgaatcagatggcctggtctttgaccctgctaagattatcagtcgt 

acaccaaatggagttgcgattccgcatggcgaccattaccactttattcc 

ttacagcaagctttctgccttagaagaaaagattgccagaatggtgccta 

tcagtggaactggttctacagtttctacaaatgcaaaacctaatgaagta 

gtgtctagtctaggcagtctttcaagcaatccttcttctttaacgacaag 

taaggack:tctcttcagcatctgatggttatatttttaatccaaaagata 

tcgttgaagaaacggctacagcttatattgtaagacatggtgatcatttc 

cattacattccaaaatcaaatcaaattgggcaaccgactcttccaaacaa 

tagtctagcaacaccttctccatctcttccaatcaatccaggaacttcac 

atgagaaacatgaagaagatggatacggatttgatgctaatcgtattatc 

gctgaagatgaatcaggttttgtcatgagtcacggagaccacaatcatta 

tttcttcaagaaggacttgacagaagagcaaattaaggtgcgcaaaaaca 

TTTAG 

(SEQ rD NO: 24) 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQK 

SENLTPDQVSQKEGIQAEQIVIKITDQGYVTSHGDHYHYYNGKVPYDALF 

SEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTK 

DEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 

YIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNT 

QSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISR 

TPNGVAIPHGDHYHFIPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEV 

VSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGDHF 

HYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRII 

AEDESGFVMSHGDHNHYFFKKDLTEEQIKVRKNI* 
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TABLE 2 
ID2 840 bp 

(SEQ ID NO: 25) 

atgggaattgctctagaaaatgtgaattttacatatcaagaaggtactccct^ 

ttctttgacgattgaagatggctcttatacagctttaattgggcacacaggtagtggtaaatcaactattttacaac 
tcttaaatggtttattggtgccaagtcaagggagtgtgagggtttttgataccttaatcacctcgacttcta 
aaagatattcgtcaaattagaaaacaggttggcttckstatttcagt^ 
ttttgaaggacgttgcttttggaccgcaaaattttggagtttctgaagaagatc 

actggctctggttggaattgatgaatcactttttgatcgtagtccgtttgagctgtcagggggacaaatgagacgt 

gttck:cattgcaggcatacttgccatggagccack:tatattagtcttagatgagccaacagctggtcta 

agggagaaaagagttgatgaccctgttcaaaaaactccaccagtcagggatgaccatcgtcttggtaacgcatttg 

atggatgatgttgctgaatatgcgaatcaagtctatgtaatggaaaagggacgtttagtaaaggggggcaaacca 

agtgatgtcittcaagacgttgtttttatggaagaagttcagttgggagt^ 

attgck:tgatagaggcgtgtcatttaaacgattaccgattaagatagaggagttcaaggagtcgctaaatggatag 

(SEQ [D NO: 26) 

mgialenvnftyqegtplasaalsdvsltiedgsytalightgsgkstilqllngllvpsqgsvrvfdtlitstsknkdirq 
[rkqvglvfqfaenqifeetvlkdvafgpqnfgvseedavktareki-alvgideslfdrspfelsggqn1rrvaiagilam 
epailvldeptagldplgrkeijvitlfkklhqsgmtivlvthu^ddvaeyanqvyvmekgrlvkggkpsdvfqdvv™ 
eevqlgvpkitafcicrladrgvsfkrlpikieefkesjlng 

ID 3 6360 bp 
(SEQ ID NO: 27) 

tacccggtagtcttagcagacacatctagctctgaagatgctttaaacatctctgataaagaaaaagtagcagaaa 

ataaagagaaacatgaaaatatccatagtgctatggaaacttcacaggattttaaagagaagaaaacagcagtca 

ttaaggaaaaagaagttgttagtaaaaatcctgtgatagacaataacactagcaatgaagaagcaaaaatcaaag 

aagaaaattccaataaatcccaaggagattatacggactcatttgtgaataaaaacacagaaaatcccaaaaaag 

aagataaagttgtctatattgctgaatttaaagataaagaatctggagaaaaagcaatcaaggaactatccagtct 

taagaatacaaaagttttatatacttatgatagaatttttaacggtagtgccatagaaacaactccagataacttg 

gacaaaattaaacaaatagaaggtatttcatcggttgaaagggcacaaaaagtccaacccatgatgaatcatgcc 

agaaaggaaattggagttgaggaagctattgattacctaaagtctatcaatgctccgtttgggaaaaattttgatg 

gtagaggtatggtcatttcaaatatcgatactggaacagattatagacataaggctatgagaatcgatgatgatgc 

caaagcctcaatgagatttaaaaaagaagacttaaaaggcactgataaaaattattggttgagtgataaaatccc 

tcatgcgttcaattattataatggtggcaaaatcactgtagaaaaatatgatgatggaagggattattttgaccca 

catgggatck:atattgcagggattcttgctggaaatgatactgaacaagacatcaaaaactttaacggcatagatg 

gaattgcacctaatgcacaaattttctcttacaaaatgtattctgacgcaggatctgggtttgcgggtgatgaaac 

aatgtttcatgctattgaagattctatcaaacacaacgttgatgttgtttcggtatcatctggttttacaggaacag 

gtcttgtaggtgagaaatattggcaagctattcgggcattaagaaaagcaggcattccaatggttgtcgctacggg 

taactatgcgacttctck:ttcaagttcttcatgggatttagtagcaaataatcatctgaaaatgaccgacactgga 

aatgtaacacgaactgcagcacatgaagatgcgatagcggtcck:ttctgctaaaaatcaaacagttgagtttgata 

aagttaacataggtggagaaagttttaaatacagaaatataggggcctttttcgataagagtaaaatcacaacaa 

atgaagatggaacaaaagctcctagtaaattaaaatttgtatatataggcaaggggcaagaccaagatttgatag 

gtttggatcttaggggcaaaattgcagtaatggatagaatttatacaaaggatttaaaaaatgcttttaaaaaagc 

tatggataagggtgcacgcgccattatggttgtaaatactgtaaattactacaatagagataattggacagagctt 

ccagctatgggatatgaagcggatgaaggtactaaaagtcaagtgttttcaatttcaggagatgatggtgtaaagc 

tatggaacatgattaatcctgataaaaaaactgaagtcaaaagaaataataaagaagattttaaagataaattgg 

agcaatactatccaattgatatggaaagttttaattccaacaaaccgaatgtaggtgacgaaaaagagattgactt 

taagtttgcacctgacacagacaaagaactctataaagaagatatcatcgttccagcaggatctacatcttggggg 

ccaagaatagatttacttttaaaacccgatgtttcack:acctggtaaaaatat^^ 

tggcaaatcaacttatggctatatgtcaggaactagtatggcgactccaatcgtggcagcttctactgttttgatta 

gaccgaaattaaaggaaatgcttgaaagacctgtattgaaaaatcttaagggagatgacaaaatagatcttacaa 

gtcttacaaaaattgccctacaaaatactgcgcgacctatgatggatgcaacttcttggaaagaaaaaagtcaata 

ctttgcatcacctagacaacagggack:aggcctaattaatgtggccaatgctttgagaaatgaagttgtagcaact 

ttcaaaaacactgattctaaaggtttggtaaactcatatggttccatttctcttaaagaaataaaaggtgataaaa 

aatactttacaatcaagcttcacaatacatcaaacagacctttgacttttaaagtttcagcatcagcgataact 

gattctctaactgacagattaaaacttgatgaaacatataaagatgaaaaatctccagatggtaagcaaattgttc 

CAGAAATTCACCCAGAAAAAGTCAAAGGAGCAAATATCACATTTGAGCATGATACTTTCACTATAGGCGCAAATTC 

TAGCTTTGATTTGAATGCGGTTATAAATGTTGGAGAGGCCAAAAACAAAAATAAATTTGTAGAATCATTTATTCAT 

TTTGAGTCAGTGGAAGCGATGGAAGCTCTAAACTCCAGCGGGAAGAAAATAAACTTCCAACCTTCTTTGTCGATGC 

CTCT A ATGGG ATTTGCTGGG A ATTGG A ACC ACG A ACC A ATCCTTG AT A A ATGGGCTTGGG A AG AA GGGTC A AG ATC 

AAAAACACTGGGAGGTTATGATGATGATGGTAAACCGAAAATTCCAGGAACCTTAAATAAGGGAATTGGTGGAGA 

ACATGGTATAGATAAATTTAATCCAGCAGGAGTTATACAAAATAGAAAAGATAAAAATACAACATCCCTGGATCA 

AAATCCAGAATTATTTGCTTTCAATAACGAAGGGATCAACGCTCCATCATCAAGTGGTTCTAAGATTGCTAACATTT 
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atcctttagattcaaatggaaatcctcaagatck^tcaacttgaaagaggattaacaccttctccacttgtattaaga 
agtgcagaagaaggattgatttcaatagtaaatacaaataaagagggagaaaatcaaagagacttaaaagtcatt 
tcgagagaacactttattagaggaattttaaattctaaaagcaatcatc^^ 

aagtttggggtgacttgaagtgggatggactcatctataatcctagaggtagagaagaaaatgcaccagaaagta 
acksataatcaagatcctgctactaagataagagctcaatttgaaccgattg^^ 

t a a at at ag att a act a a ag att accc atggc a ggtttcct at attcctgt a a aa attg at a ac accgcccct a ag 

attgtttcggttg atttttc a a atcctg a a aa a att a agttg att ac a a agg at actt atc at a aggt a a a ag atc a 

gtataagaatgaaacgctatttgcgagagatcaaaaagaacatcctgaaaaatttgacgagattgcgaacgaagt 

ttggtatgctggcgccgctcttgttaatgaagatggagaggttgaaaaaaatcttgaagtaacttacgcaggtgag 

ggtcaaggaagaaatagaaaacttgataaagacggaaataccatttatgaaattaaaggtgcgggagatttaagg 

ggaaaaatcattgaagtcattgcattagatggttctagcaatttcacaaagattcatagaattaaatttg^ 

aggctgatgaaaagggcatgatttcctattatctagtagatcctgatcaagattcatctaaatatcaaaagcttgg 

cgagattgcagaatctaaatttaaaaatttaggaaatggaaaagagggtagtctaaaaaaagatacaactggggt 

ag a ac atc atc atc a ag a a a atg a ag agtct att a a ag a a a a atct agtttt act attg at ag a a at atttc a ac a 

attagagactttgaaaataaagacttaaagaaactcattaaaaagaaatttagagaagttgatgattttacaagtg 

aaactggtaagagaatggaggaatacgattataaatacgatgataaaggaaatataatagcctacgatgatggga 

ctgatctagaatatgaaactgagaaacttgacgaaatcaaatcaaaaatttatggtgttctaagtccgtctaaaga 

tggacactttgaaattcttggaaagataagtaatgtttctaaaaatgccaaggtatattatgggaataactataaa 

tctatagaaatcaaagcgaccaagtatgatttccactcaaaaacgatgacatttgatctatacgctaatattaatg 

atattgtggatggattagcttttgcaggagatatgagattatttgttaaagataatgatcagaaaaaagctgaaat 

t a a a att ag a atgcctg a a a a a att a agg a a act a a atc ag a at atccct atgt atc a agtt atggg a atgtc at a 

gaattaggggaaggagatctttcaaaaaacaaaccagacaatttaactaaaatggaatctggtaaaatctattct 

gattcagaaaaacaacaatatctgttaaaggataatatcattctaagaaaaggctatgcactaaaagtgactacct 

ataatcctggaaaaacggatatgttagaaggaaatgcagtctatagcaaggaagatatagcaaaaatacaaaag 

gccaatcctaatctaagagccctttcagaaacaacaatttatgctgatagtagaaatgttgaagatggaagaagta 

cccaatctgtattaatgtcggctttggacggctttaacattataaggtatcaagtgtttacatttaaaatgaacgat 

aaaggggaagctatcgataaagacggaaatcrtgtgacagattcttctaaacttgtattatttggtaaggatgata 

aagaatacactggagaggataagttcaatgtagaagctataaaagaagatggctccatgttatttattgataccaa 

accagtaaacctttcaatggataaga act actttaatccatctaaatctaataaaattt atgt acgaaatccagaa 

tttt attt a ag aggt a ag atttctg at a agggtggtttt a actggg a attg ag agtt a atg a atcggttgt ag at a 

attatttaatctacggagatttacacattgataacactagagattttaatattaagctgaatgttaaagacggtga 

catc atggactggggaatgaaagactataaagcaaacggatttccagataaggtaacag at atggatggaa atgt 

ttatcttcaaactggctatagcgatttgaatgctaaagcagttggagtccactatcagtttttatatgataatgtta 

aacccgaagtaaacattgatcctaagggaaatactagtatcgaatatgctgatggaaaatctgtagtctttaacat 

caatgataaaagaaataatggattcgatggtgagattcaagaacaacatatttatataaatggaaaagaatatac 

atcatttaatgatattaaacaaataatagacaagacactaaacattaagattgttgtaaaagattttgcaagaaat 

acaaccgtaaaagaattcattttaaataaagatacgggagaggtaagtgaattaaaacctcatagggtaactgtg 

accattcaaaatggaaaagaaatgagttcaacgatagtgtcggaagaagattttattttacctgtttataagggtg 

aattagaaaaaggataccaatttgatggttgggaaatttctggtttcgaaggtaaaaaagacgctggctatgttat 

taatctatcaaaagatacctttataaaacctgtattcaagaaaatagaggagaaaaaggaggaagaaaataaacc 

tacttttgatgtatcgaaaaagaaagataacccacaagtaaaccatagtcaattaaatgaaagtcacagaaaaga 

ggatttacaaagagaagagcattcacaaaaatctgattcaactaaggatgttacagctacagttcttgataaaaac 

aatatcagtagtaaatcaactactaacaatcctaataagttgccaaaaactggaacagcaagcggagcccagaca 

ctattagctgccggaataatgtttatagtaggaatttttcttggattgaagaaaaaaaatcaagattaa 

(SEQ ID NO: 28) 

YPVVLADTSSSEDA1J^ISDKEKVA£NKEK11ENIHSA 
KSQGDYTDSFVNKNTENPKKEDKVVYIAEFKDKESGEKAIKJE^ 

SVERAQKVQPMMNHARK£IGVEEAIDYLKSINAPFGKNFDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDLK 

GTDKNYWLSDKIPHAFNYYNGGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDIKNFNGIDGIAPNAQIFSYKMYSD 

AGSGFAGDETMFHAIEDSIKHNVDVVSVSSGFTGTGLVGEKYWQAIRALRKAGIPMVVATGNYATSASSSSWDLVANN 

HLKJV1TDTGNVTRTAAHEDAIAVASAKNQTVEFDKVNIGGESFKYRNIGAFFDKSKITTNEDGTKAPSICLICFVYIGKGQD 

QDLIGLDLRGKIAVMDRIYTKDLKJSfAFKKAMDKGARACS4VVNTVNYYNRDNWTELPAMGYEADEGTKSQVFSISGDD 

GVKLWNMINPDKKTEVKRNNKEDFKDKI^QYYPIDMESFNSNKPNVGDEKEIDFKFAPDTDKELYKEDIIVPAGSTSWG 

PRIDLLLKPDVSAPGKNIKSTLNVINGKSTYGYMSGTSMATPIVAASTVL1RPKLKEMLERPVLKNLKGDDKIDLTSLTKI 

ALQNTARPMMDATSWKEKSQYFASPRQQGAGLINVANALRNEVVATFKNTDSKGLVNSYGSISLKEIKGDKKYFTIKL 

HNTSNRPLTFKVSASAITTDSLTDRLKLDETYKDEKSPDGKQIVPEIHPEKVKGANITFEHDTFTIGANSSFDUSIAVINVGE 

AKNKNKFVESFIHFESVEAMEALNSSGKKINFQPSL^MPLMGFAGNWNHEPILDKWAWEEGSRSKTLGGYDDDGKPKIP 

GTLNKGIGGEHGIDKFNPAGVIQNRKDKNTTSLDQNPELFAFNNEGINAPSSSGSKIANIYPLDSNGNPQDAQLERGLTPS 

PLVLRSAEEGLISIVNTNKEGENQRDLKVISREHFtRGILNSKSNDAKGIKSSKLKVWGDLKWDGLIYNPRGREENAPESK 

DNQDPATKrRGQFEPIAEGQYFYKFKYRLTKDYPWQVSYrPVKIDNTAPKIVSVDFSNPEKIKLITKDTYHKVICDQYKNE 

TLFARDQKEHPEKFDEIANEVWYAGAALVNEDGEVEKNLEVTYAGEGQGRNRKLDKDGNTIYEIKGAGDLRGKIIEVIA 

LDGSSNFTKIHRIKFANQADEKGMISYYLVDPDQDSSKYQKLGEIAESKFKNLGNGKEGSLKKDTTGVEHHHQENEESIK 

EKSSFTIDRNISTIRDFENKDLKKLIKKKFREVDDFTSETGKRMEEYDYKYDDKGNIIAYDDGTDLEYETEKLDEIKSKIY 

GVLSPSKDGHFEILGKISNVSKNAKVYYGNNYKSLEIKATKYDFHSKTMTFDLYANINDIVDGLAFAGDMRLFVKDNDQ 

KKAEIKIRMPEKIKETKSEYPYVSSYGNV1ELGEGDLSKNKPDNLTKMESGKIYSDSEKQQYLLKDNIILRKGYALKVTT 
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YNPGKTDM LEGNGV YS KEDIAKIQKANPN Al^ETTl Y ADSRN VEDGRSTQS VUVIS ALDGFN 11R YQ VFTFKMNDKGE 
AIDKDGNLVTDSSKLVLFGKDDKEYTGEDKFNVEAIKEDGSMLFIDTKP^ 

SDKGGFNWELRVNESWDNYLIYGDLH fDNTKDFNIKlJ^VKiDGDIMDWGMKJDYKANGFPDKVTDMDGNVYLQTGYS 
DlJVlAKAVGVHYQFLYDNVKPEVNrDPKGNTSIEYADGKSVVFNUNlDKRNNGFDGEIQEQHIYINGK£YTSFNDIKQIIDK 
TLNIK[VVKDFARNTTVK£FILNKDTGEVSELKPHRVTVTIQNGK£MSSTrVSEEDFILPVYKGELEKGYQFDGWEISGFE 
GKKDAGYVINLSKDTFIKPVFKKIEEKKEEENKPTFDVSKKKDNPQVNHSQLNESHRKEDLQREEHSQKSDSTKDVTAT 
VLD KNN ISS KSTTNNPN KLPKTGTASG AQTLLA AG IM FIVGIFLGLKKKNQD 

ID6 597 bp 

(SEQ ID NO: 29) 

cttgaattaaataaaaaacgtcatgcgactaagcattttactgataagcttgttgatcccaaagatgtgcgtacgg 
ctatcgaaattgcaaccttack:gccaack:gcccacaacagccagccttggaaatttgtggtggtacgtgagaaaaa 
tgctgaactogcaaagttagcttatggttccaattttgaacagg 
cagatacggacttagccaaacgtckitcgtaagattgcccgtgttggtggtgc^ 

tcaatattttatgaaaaatctgccagctgagtttgcccgttacagtgagcaacaagtcagcgactacctagctctc 
aatgcaggtttggttck:catgaacttggttcttgcat^ 

tgacaaatcaaaagttaatgaagttttggaaatcgaagaccgtttccgcccagaactcttgatcacagtgggttat 
acagacgaaaaattggaaccaagctaccgcttgccagtagatgaaatcatcgagaaaagatag 

(SEQ ID NO: 30) 

L£LNKKJ*HATKJJFTDKJ.VDPKDVRTAIEIATl^PSAHNSQPWK 

DLAKRARKIARVGGANNFSEEQLQYFMKNLPAEFARYSEQQVSDYLALNAGLVAMNLVLALTDQG1GSNIILGFDKSK 
VNEVLEIEDRFRPELLITVGYTDEKLEPSYRLPVDEIIEKR 

IP7 1401 bp 

(SEQ ID NO: 31) 

ATGACAGCAATTGATTTTACAGCAGAAGTAGAAAAACGCAAAGAAGACCTCrTGG^ 

AAATCAATTCAGAACGTGATGACAGCAAGGCTGATCK^CCACK^ATCCATTTGGGCCTGGTCCAGTAAAAGCCTTGGA 

GAAATTCCTTGAAATCGCAGACCGCGATGGCTACCCAACTAAGAATGTTGATAACTATGCAGGACATTTTGAGTTT 

GGTGATGGAGAAGAAGTTCTCGGAATCTTTGCCCATATGGATGTGGTGCCTGCTGGTAGCGGTTGGGACACAGACC 

CTTACACACCAACTATCAAAGATGGTCGCCTTTATGCGCGCGGGGCTTCGGACGATAAGGGTCCTACAACAGCTTG 

TTACTATGGTTTGAAAATCATCAAAGAATTGGGTCTTCCAACTTCTAAGAAAGTTCGCTTCATCGTTGGAACAGACG 

AAGAATCAGGCTGGGCAGACATGGACTACTACTTTGAGCACGTAGGACTTGCCAAACCAGATTTCGGTTTCTCACC 

AGATGCTGAATTTCCAATCATCAATGGTGAAAAAGGAAATATCACGGAATACCTCCACTTTGCAGGAGAAAATAC 

AGGTGTTGCCCGTCTTCACAGCTTTACAGGTGGTTTACGTGAAAATATGGTACCAGAATCAGCAACAGCAGTCGTT 

TCAGGTGACTTGGCTGACTTGCAAGCTAAACTAGATGCCTTTGTTGCAGAACACAAACTTAGAGGAGAACTCCAAG 

AAGAACK:TCH3CAAATACAAGGTGACGATCATTGGTAAATCAGCCCACGGTGCTATGCCTGCTTCAGGTGTCAATGG 

CGCAACTTACCTTGCCCTCTTCCTCAGCCAGTTTGGCTTTGCTGGTCCAGCCAAAGACTACCTTGACATCGCAGGTA 

A A ATTCTCTTG A ACG ATC ATG AGGGTG A A A ATCTT A AG ATTGCTC ATGTGG ATG A A A AG ATGGGTGCTCTTTCT AT 

GAATGCCGGCGTCTTCCACTTCGATGAAACAAGTGCTGATAATACCATTGCCCTCAACATCCGCTATCCAAAAGGA 

ACAAGTCCAGAACAAATCAAGTCAATCCTTGAAAACTTGCCAGTTGTTTCTGTTAGCCTGTCTGAACACGGTCACA 

CGCCTCACTATGTGCCAATCKjAAGATCCACTTGTGCAAACCTTGTTGAATATCTATGAAAAACAAACTGGCTTTAA 

ACKjTCATGAACAAGTCATCGGTGGTGGAACCTTTGGTCGCTTGCTAGAACGCGGAGTTGCCTACGGTGCTATGTTC 

ccagactcgattgataccatgcaccaagccaatgaatttatcgccttggatgatcttttccgagcagcagcaattta 
tgccgaagctatttacgaattgatcaaataa 

(SEQ ID NO: 32) 

mta1dftaevekrkedlladlfslleinserddskadaqhpfgpgpvkalekfleiadrdgyptknvdnyaghfefgdg 

eevlgifahmdvvpagsgwdtdpytptikdgrlyargasddkgpttacyyglkiikelglptskkvrfivgtdeesgwa 

dmdyyfehvglakpdfgfspdaefpiingekgniteylhfagentgvarlhsftgglrenmvpesatavvsgdladlqa 

ki.dafvaehiclrgelqeeagkykvtiigksahgampasgvngatylalflsqfgfagpak:dyldiagkillndhegenl 

kiahvdekmgalsmnagvfhfdetsadntialnirypkgtspeqiksilenlpvvsvslsehghtphyvpmedplvqtll 

niyekqtgfkgheqvigggtfgrllergvaygamfpdsidtmhqanefialddlfraaaiyaeaiyelik 

ID8 1617 bp 

(SEQ ID NO: 33) 

GTGTATACTATTATAAAATCAAATATAAAAAAATTTAGTTTATTAACGATATTTATTGTTGCTGGTCAATTATTGCT 

AATTTATGCAGCAACTATTAATC^TCTGGTGTTGAATGAATTAATTGCGATGAATTTAGAGCGG IMUM'J'GAAATTGT 

CAATCTACCAAATGATTGTCTGGTGTGGGATAATATTCCTTGACTGGGTAGTGAAAAATTATCAGGTTGAAGTGAT 

CCAAGAGTTTAATCTAGAGATTCGAAATAGAGTTGCCACAGACATCTCTAACTCTACCTATCAAGAATTTCATAGT 

AAATCATCAGGAACATATCTTTCGTGGCTAAATAATGATGTTCAGACTTTAAATGATCAGGCGTTTAAACAACTTTT 

TTTAGTAATAAAAGGAATTTCTGGTACTATATTTGCAGTTGTGACTCTTAATCACTATCATTGGTCATTGACTGTAG 

CCACCTTGTTTTCATTAATGATTATGCTACTTGTACCAAAAATCTTTGCATCGAAAATGCGAGAAGTTAGTCTAAAT 
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ttaactaaccaaaatgaagcrrririaaaatctagtgagactatattgaatggatttgatgtgttagcgtccttgaa 
tcttttatatgtattgcctaagaaaattaaagaack:aggaattttattaaagatggttatacaa 
gtagaaacgttagcaggcgctattagcttctttctcaatai1 trrritcagatatctctcg 11111*1 iaacaggctat 
cttgcaataaaaggaatagtgaaaattggtactattgaagcaatagga 

t aggtg a att agg aggtc a att atcctct attattggt acg a agcct a ' a a a attgt attc a att a atcc a 

attgagtcaaataaaatgaatgatatcgaaccaaatgaggtgaatagagattttccgttatatgaagcaaaaaat 

atttgctataagtatggagataaagaaatattaaaaaacttaaatttttgttttcaacgtaatgaaaagtatttaat 

ttt aggtg a a agtgg aagcggg a a atct ac att att aa a att attg a atggctttttg ag ag att at agtgg ag a a 

ttck:gattctgcggggatgatataaaaaaaacctcctatttaaatatggtttcgaatgttctatatgtagatcaaaa 

agcttatttgtttgaaggtacgattagagataatattttattggaagaaaattatactgatgaagaaatact 

tctttagagcaagttggtttgagtgtaaaagattttcctaataacattttagattattatgttggt^ 

attactgtcaggagggcagaaacaaaaaattactttagctagagggctaattagaaataagaaaatagtattaat 

tgacgagggaacttctgctatcgataggagaacrrrcgttagcgattgaacgtaagatattagatagagaggatttg 

actgtcattattgttacccatgctccgcatccggaacttaaacaatattttactaagatatatcaatttccaaagga 

1 1 I I ATTTA A 
(SEQ ID NO: 34) 

MYTIIKSNIKKFSLLTIFIVAGQLXLIYAATn^ALVLNELIAMNI^RFLK^ 

IRNRVATDISNSTYQEmSKSSGTYl^WLNNDVQTLNDQAFKQLFLVIKGISGTIFAVNO'LNHYHWSLTVATLFSLMIML 

LVPKIFASKMREVSLNLTNQNEAFLKSSETILNGTOVI^SLNIlL^ 

FQISLVFLTGYLAIKGIVKIGTIEAICALTGVIFTAIXj^^^ 

KNICYKYGDKEILKNLNFCFQRNEKYLILGESGSGKSTLLKLLNGFLRDYSGELRFCGDDIKKTSYLNMVSNVLYVDQK 

AYLFEGTrRDNILl^ENYTDEEILQSLEQVGLSVKDFPNNILDYYVGDDGRLLSGGQKQKlTLARGLlRNKKIVLIDEGTSA 

IDRRTSLAIERKILDREDLTVIIVTHAPHPELKQYFTKIYQFPKDFI 

ID9 705 bp 

(SEQ ID NO: 35) 

ataacagttaaacagattatggacgaaatack;cgtttcagatatgactgcaaggcgctatttacaggaattagctg 
ataaagatttgctgattcgtgtgcatggtggagctgaaaaacttcgaaccaactcccttttgactaatgagcgatc 
aaatattgaaaaacaagccctccaaacgck:agaaaaacaagaaatagcccattttgcaggcagtctagtagaaga 
aagagaaactattttcattggaccaggaacaacattagagttttttc^ 

gtcgtaaccaacagtctacctgtttttctgattttaagcgaacgaaaattaacagatttgattttaata 
ttatcgcgatattacaggtgcttttgttckjtacattgaccct 

cgttagctgtaatggtattcaaaacggack:tctack:tacttttagcgaggaagagggagaggctcaacgcatcgct 
ttaaataattctaataaaaaatatttactcgcagatcatagcaagttcaataagtttgatttttatacttt^ 
tgtatcaaatcttgatactattgtttcagattctaaactaagtgattcaatcctttttaagctatctaaacacattaa 
agtcatcaaoccttaa 

(SEQ ID NO: 36) 

ITVKQIMDEIAVSDMTARRYLQELADKDLLIRVHGGAEICLRTNSLLTNERSNIEKQALQTAEKQEIAHFAGSLVEERETIF 
IGPGTTLEFFARELPIDNIRVVTNSLPVFLILSERKLTDLILIGGNYRDITGAFVGTLTLQNLSNLQFSKAFVSCNGIQNGAL 
ATFSEEEGEAQRIALNNSNKKYLLADHSKFNKroFYTFYNVSNLDTIVSDSKLSDSILFKLSKHIKVIKP 

ID10 483 bp 

(SEQ ID NO: 37) 

ATGACTGAGTTTTCGTTAGATCTTCTTCTAGAAGCCATTAAACTAGCTCGTTGGACCTACTACTATCACTTGAAACA 

GCTAGACAAAACAGATAAAGACCAAGAGCTTAAAACTGAAATTCAATCCATCTTTATCGAACACAAGGGAAATTA 

TGCTT ATCGCCGGGTTC ATTT A G A ACT A AG A A ATCGTGGTT ATCTGGT A A ATC AT A A A AG AGTTC A AGGCTTGaTG A 

AAGTACTCAATTTACAAGCTAAAATGCGAAAGAAACGAAAATATTCTTCTCATAAAGGAGACGTTGGTAAGAAGG 

CAGAGAATCTCATTCAAGCCCAATTTGAAGGCTCTAAAACAATGGAAAAGTGCTACACAGATGTGACTGAATTTGC 

CATTCCAGCAAGTACTCAAAAGCTTTACTTATCACCAGTTTTAGATGGCTTTAACAGCGAAATTATTGCTTTT^ 

TTTCTTGTTCGCCTAATTTAGAATAA 

(SEQ ID NO: 38) 

MTEFSLDLLLEAIKLARWTYYYHLKQLDKTDKDQELKTEIQSIFIEHKGNYAYRRVHLELRNRGYLVNHKRVQGLMKV 
LNLQAKMRKKRKYSSHKGDVGKKAENLIQAQreGSKTMEKCYTDVTEFAIPASTQKLYI^PVLDGFNSEIIAFNLSCSPN 
LE 

ID14 1266 bp 

(SEQ ID NO: 39) 

ccaggatttggtaccgttgcaagtggtgtck:ctttcctcctaaaggaaaatggaggaaaaatcaatcaatcagcac 

attcagatatcaaagttc^taaggtattggtcaaggatgaagatgaaaaaaatcgcttgcttgcagcagggaatg 

actttaactttgtaaccaatgtggatgatattttatcagaccaggatattactatcgtagtggaattgatggggcgt 
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attgagcctcctaaaacctttatcactcgt^ 
acctgtccatggcck:agaattgctagaaatcgct^ 

ggtgggattccaattcttcgtactttagcaaattccttggcttctgataaaattacgcgcgt 

cggaacitccaacttcatggtgaccaagatggtggaagaagck:tck3tcttacgatgatgctcttgcggaac^ 

cgtctaggatttgcagaaagcgatccgacgaatgacgtagatgggattgatgcagcctacaagatggttattttga 

ck:caatttgcctttggcatgaagattgcctttgatgatgtagcccacaagggaatccgcaatatcac 

cgtagctgtackitcaagagcttggttacgtagtgaaattggttggttctattgaggaaacttcttcagg 

cagaagtgactccaaccttcctacctaaagcgcacccacttgctagtgtgaatggcgtaatgaacgctgtctttgt 

agaatctatcggtattggtgagtctatgtactacggaccaggtgcgggtcaaaaaccaactgcaacaagtgttgta 

gctgatattgtccgtatcgttcgtcgtttgaatgatggtactattggcaaagacttcaacgaatatagccgtgactt 

ggtcttggcaaatcctgaagatgtcaaagcaaactactatttctcaatctt^ 

ttgaagttggctgaaatcttcaatgctcaagatatttcctttaagcaaatccttcaagatggcaaagagggtgaca 

aggcgcgtgtcgttatcatcacacacaagattaataaagcccagcttgaaaatgtctcagctgaattgaagaaggt 

ttcagaattcgacctcttgaataccttcaaggtgctaggagaataa 

(SEQ ID NO: 40) 

F<}FGTVASGVPFLLKENGGKINQSAHSDIKVAKVLVKDEDEKNRLLAAGNDFNFVTNVDDII^DQDITIVVEIJVIGRIEPA 
KTFITRALEAGKHVVTANKDLLAVHGAELLEIAQANKVALYYEAAVAGGIPILRTLANSLASDKITRVLGVVNGTSNFM 
VTKMVEEGWSYDDALAEAQRLGFAESDFTNDVDGIDAAYKJVIVIL^QFAFGMKrAFDDVAHKGIRNITPEDVAVAQELG 
YVVKJ.VGSIEETSSGIAAEVTPTFLPKAHPLASVNGVMNAVFVESIGIGESMYYGPGAGQKPTATSVVADIVRIVRRLND 
GTIGKDFNEYSRDLVLANPEDVKANYYFSILALDSKGQVLKLAEIFNAQDISFKQILQDGKEGDKARVVUTHKINKAQLE 
NVSAELKKVSEFDLLNTFKVLGE 

ID16 1725 bp 

(SEQ [D NO: 41) 

atgaaacacctattatcttacttcaaaccctacatcaaggaatcaattttagcccccttgttcaagctgttagaagc 

tgtttttcacctcttggttcccatggtgatt^ 

tctggatgcagattck3cctgctccttatcttt^ 

caaagck:agcagtaggttctgctaaggaattgacaaacgatctttatcgtcatattctttccttgcccaaggacag 
cagagaccgtctgacaacttctagtttggtcactcgcttgacttcggatacctaccagattcagactggtatcaatc 
aattcctgcgtctctttttacgagcgcccattatcgi'i ri "i'ggtgccati th atggcttatcgaatctcagctgagt 
tgactttctggttcttagtcttggttgccattttc 

acagtagtctcagaaagaaaacggaccaactggttcaggaaacgcgccagcaattgcaagggatgcgggttattc 

gtgcttttggtcaagaaaaacgagagttacagatttttcaaacccttaaccaagtttatgctagattacaagaaaa 

gacaggtttctggtctagtttattaacacctctgacctatctgattgtcaatggaactcttctcgttattatctggca 

aggctatatttcaattcaaggaggagtgctcagtcaaggtgctctcattgctcttatcaattacctcttacagattt 

tggtcgaattggtcaagctagccatgttgatcaattccctcaaccagtcctatatctcagtcaagcgaatcgagga 

agtctttgttgaggctccagaggatatccattcagagttagaacaaaagcaagctaccagagataaggttttacaa 

gtccaagaattgacctttacctatcctgatgcggcccagccttctctgagatacatttcctttgatatgactcaagg 

ac a a attct aggt atc atcggggg a actggttctggt a a atc a agcttggtgc a actctt acttgg acttt atcc ag 

tagacaaggggaacattgacctttatcaaaatggacgtagtcctcttaatttggagcagtggcggtcttggattgc 

ctatgtacctcaaaaggtcgaactctttaaaggaaccattcgttccaacttgactctaggtttcaatcaagaagtat 

ctgaccaggaactctggcagck:cttggagattgcck:aagctaaggattttgtcagtgaaaaggaaggactcttgg 

atgctctagttgaggcaggggggcgaaatttctcaggtggacaaaaacaaagattgtctatcgcccgagcagtctt 

gcgccaggctccgtttctcatcctagatgatgcaacctcggcactggataccattacagagtccaagctcttgaaa 

gctattagagaaaattttccaaacacgagcttaattttgatctctcaacgaacctcaactttacagatggcggacc 

agattctcctcttggaaaaaggtgagttgctagctgttggcaagcacgatgacttgatgaaatccagccaagtcta 

ttgtgaaatcaatgcatcccaacatggaaaggaggactag 

(SEQ ID NO: 42) 

MKHLl^YFKPYIKESILAPLFKLLEAVFELLVPMVIAGIVDQSLPQGDQGHLWMQIGLLLIFAVIGVLVALIAQFYSAKAA 

VGSAK^LTNDLYRHILSLPKDSRDRLTTSSLVTRLTSDTYQIQTGINQFLRLFLRAPIIVFGAIFMAYRISAELTFWFLVLVA 

ILTIVIVGI^RLVNPFYSSLRKKTDQLVQETRQQLQGMRVIRAFGQEKRELQIFQTLNQVYARLQEKTGFWSSLLTPLTYL 

IVNGTLLVIIWQGYISIQGGVLSQGALIALINYI^QILVELVKIAMLINSLNQSYISVKRIEEVFVEAPEDIHSELEQKQATR 

DKVLQVQELTFTYPDAAQPSLRYISFDMTQGQILGIIGGTGSGKSSLVQLLLGLYPVDKGNIDLYQNGRSPLNLEQWRSW 

IAYVPQKVELFKGTIRSNLTLGFNQEVSDQELWQALEIAQAKDFVSEKEGLLDALVEAGGRNFSGGQKQRLSIARAVLR 

QAPFLILDDATSALDTITESKLLKAIRENFPNTSLILISQRTSTLQMADQtLLLEKGELLAVGKHDDLM KSSQVYCEINASQ 

HGKED 

ID18 1224 bp 

(SEQ ID NO: 43) 

atg a a acgttctctcg actc a ag agtcg attac agtttgctcttgcc agt a 111111 ctactggtc atcggtgtggt 
ggctatctatatagccgttagtcatgattatcccaataatattctgcccattttagggcagcaggtcgcctggattg 
ccttggggcitgtgattggttttgtggtcatcotctt^^ 
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TTTTACXXnTGGGACTTATGATCTTGCCGATTC 

TGCKjTATCAATAAATCKjAATTACCCTATTCCAACCGTCAGAATTTATGAAGATATCCTATATCCTCATGTTGGCTC 
TGTCATTGTCCAATTTACAAAGAAACATAAGGAATGGAGACGCACGGTTCCGCTGGAC1 I I I 1 GTTA ATTTTCTGGA 
TGATTCTCTTTACCATTCCAGTCCTAGTTCTT^ 

TTTTCTCAGGAATCGTTTTATTATCAGGGGTTTCTTGGAAAATTATTATCCCAGTATTTGTGACT 

GTTGCTGGTTTCTTAGCTATCTTTATTAGCAAGGACGGACGAGCTTTTCnTCACCAGATTGGAATGCCGA 

A ATT A ATCGG ATTTTGGCTTGGCTC A ATCCCTTTG A GTTTGCCC A A AC A ACG ACTT ACC AGC AGGCTC A AGGGC AG 

ATTGCC ATTGGG A GTGGTGGCTT ATTTGGTC AGGG ATTT A ATGCTTCG A ATCTGCTT ATCCC AGTTCG AG AGTC AG A 

TATGATTTTTACGGTTATTGCAGAAGATTTTGGCTTTATTGGCTCTGTCCTGGTTATTGCC 

TTACCGTATGTTGAAGATTACTCrTAAATCAAATAACCAGTTCTACACTTATATrrCCACAGGTTTGATTATGATGTT 
GCTCTTCCACATCTTTGAGAATATCGGTGCTGTGACTGGACTACTTCCTTTGACGGGGATTCCCTTGCCTTT 
GCAAGGGGGATCAGCTATTATCAGTAATCTGATTGGTGTTGGTTTGCTTTTATCGATGAGTTACCAGACT 
CTGAAGAAAAGAGCGGAAAAGTCCCATTCAAACGGAAAAAGGTTGTATTAAAACAAATTAAATAA 

(SEQ ID NO: 44) 

MKJ^SU)SRVDYSLLLPVFFLLVIGVVAIYIAVSHDYPNNILPILGQQVAW1ALGLVIGFVVMLFNTEFLWKVTPE^YILGL 
GLMILPIVFYNPSLVASTGAK^WVSINGITLFQPSEFMKISYILMLARVrV^ 
VLl^LQSDLGTALVFVAIFSGIVLLSGVSWKIUPVFVTAVTGVAGFLAIFISK^^ 
QTTTYQQAQGQIAIGSGGLFGQGFNASNLLIPVRESDMIFTVIAEDFGFIGSVLVI^^ 

TGLIMMLLFHIFENIGAVTGLLPLTGIPLPFISQGGSAIISNLIGVGLLLSMSYQTNLAEEKSGKVPFKRKKVVLKQIK 
ID22 987 bp 

(SEQ ID NO: 45) 

ATGGTGGCTAAGAAAAAAATCTTATTTTTTATGTGGTCTTTrrCTCrrGGAGGTGGTGCAG 

CATTGTTTCAAATCTGGATCCAGAAAAGTATGATATTGATATTCTTGAAATGGAGCACTTTGACAAGGGATATGAA 

TCTGTTCCA AAGC ATGTACGC ATTTTA AAATCCCTTC A AG ATTATCGCCA AACC AG ATGGTTACGAGC I " I 1 ITI'GTG 

G AG AATG AG A ATTT ATTTTCC A AG ACTG ACTCGTCGTTTGCTTGT A A A AG ATG ATT ATG ATGTTG A AGTTTC II " II A 

CCATTATGAATCCACCACTGTTGTTCTCTAAAAGAAGAGAAGTCAAGAAGATATCTTGGATTCATGGAAGTATTGA 

AGAACTTCTTAAGGATAGCTCTAAAAGAGAATCACATAGAAGCCAGTTGGATGCTGCGAATACAATTGTAGGGATT 

TCAAAAAAGACCAGCAATTCTATCAAGGAAGTTTATCCAGATTATACTTCTAAATTACAGACAATCTACAATGGAT 

ATGATTTTCAGACTATTCTAGAAAAATCTCAAGAGAAGATCGATATCGAGATTGCTCCTCAAAGTATCTGTACTATC 

GGACGGATTGAGGAAAATAAGGGTTCTGACCGTGTAGTGGAAGTGATACGATTATTACACCAAGAGGGAAAAAAC 

TATCATCTCTATTTTATCGGGGCTGGTGATATGGAAGAGGAACTGAAAAAACGAGTCAAAGAGTATGGGATTGAG 

GACTATGTACATTTCCTTGGTTATCAAAAAAATCCTTATCAGTATCTATCTCAGACGAAAGTTCTTTTGTCTATGTCT 

AAACAAGAAGGTTTTCCTGGAGTGTATGTGGAGGCCTTGAGTCTGGGACTCCCTTTTATCTCTACGGACGTTGGAG 

GGGCTGAGGAATTATCCCAAGAAGGACGATTTGGACAAATCATTGAGAGCAATCAAGAGGCAGCTCAGGCGATTA 

CTAATTACATGACTTCTGCCTCAAACTTTGATGTCGATGAGGCTAGCCAATTCATTCAACAATTTACAATTACAAAA 

CAAATCGAACAAGTAGAAAAACTATTAGAGGAGTAG 

(SEQ ID NO: 46) 

mvakkkilf™wsfslgggaekilstivsnldpekydidilemehfdkgyesvpkhvrilkslqdyrqtrwlraflwrm 

riyfprltrrllvkddydvevsftimnppllfskrrevkkiswihgsieellkdsskreshrsqldaantivgiskktsnsik 

evypdytsklqtiyngydfqtileksqekidieiapqsictigrieenkgsdrvvevirllhqegknyhlyfigagdmeeel 

kkj^vkeygiedyvhflgyqkjvlpyqyl^qtkvllsmskqegfpgvyvealslglpfistdvggaeelsqegrfgqilesnq 

eaaqaitnymtsasnfdvdeasqfiqqftitkqieqvekllee 

ID23 1434 bp 

(SEQ ID NO: 47) 

ATGGAAACTGCATTAATTAGTGTGATTGTGCCAGTCTATAATGTGGCGCAGTACCTAGAAAAATCGATAGCTTCCA 

TTCAGAAGCAGACCTATCAAAATCTGGAAATTATTCTTGTTGATGATGGTGCAACAGATGAAAGTGGTCGCTTGTG 

TGATTCAATCGCTGAACAAGATGACAGGGTGTCAGTGCTTCATAAAAAGAACGAAGGATTGTCGCAAGCACGAAA 

TGATGGGATG AAGC AGGCTCACGGGGATTATCTGATTTTTATTGACTCAGATGATT AT ATCCATCCAGA AATG ATTC 

AGAGCTTATATGAGCAATTAGTTCAAGAAGATGCGGATGTTTCGAGCTGTGGTGTCATGAATGTCTATGCTAATGA 

TGAAAGCCCACAGTCAGCCAATCAGGATGACTATTTTGTCTGTGATTCTCAAACATTTCTAAAGGAATACCTCATA 

GGTG A A A A A AT ACCTGGG ACG ATTTGC A AT A A GCT A ATC A AG AG AC AG ATTGC A ACTGCCCT ATCCTTTCCTA AGG 

GGTTGATTTACGAAGATGCCTATTACCATTTTGATTTAATCAAGTTGGCCAAGAAGTATGTGGTTAATACTAAACCC 

TATTATTACTATTTCCATAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGATTTAGCCTATATTGATAT 

CTACCAAAAGTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGAAAGAGGTCGCTTTTTTCAGATTGGCCTATG 

CCCACTTCTTTATTCTGGATAAGATGTTGCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGATTCATCGT 

TTTTTAAAAGGCCATGCCTTTGCTATTTCTAGGAATCCAATTTTCCGTAAGGGGA 

ATTCATAAATATTTCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATCTAAAAAATTACATTAG 



(SEQ ID NO: 48) 
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METALISVIVPVYNVAQYLEKSIASIQKQTYQNLEIILVDDGATDESGRLCDSIAEQDDRVSVLHKKNEGLSQARNDGMK 

QAHGDYLIFIDSDDYIHPEM IQSLYEQLVQEDADVSSCGVMNVYANDESPQSANQDDYFVCDSQTFLKEYLIGEKIPGTI 

CNKJ.IKJ^QIATAl^FPKGLIYEDAYYHroLIKLAKKYVVNrTKPYYYYFHRGDSnTKPYAEKDLAYIDIYQKFYNEVV^ 

YPDLKEVAFFRlJVYAHFFUX>KMLLDDQYKQFEAYSQIHRFLKGHAFAISRNPIF^ 

KSKKLH 



ID24 735bp 
(SEQ ID NO: 49) 

ATGAGAATCAAAGAGAAAACCAATAATATTAATGGAGGAATAAAAAATGTAAGTAAGCATTATGGTCATTCAATC 

ATTCTCAAAGATATAAATTTTGCACTTAACAAGGGTGAAATTGTTGGTCTAGCAGGGAGAAATGGAGTTGGTAAGA 

GTACGTTGATGAAAATTCTTGTTCAGAATAATCAACCGACTTCAGGTAATATTATAAGCAGTGATAATGTTGCGTA 

TTTAATCGAAGAACCAAAATTATTTTTATCTAAAACAGGTTTAGAGAATTTAAAATATTTGTCAAAT^ 

TTGACTACAATCAAGAAAGATTTAGATGTTTGATCCAAGAGTTAGATTTGACTCAGTCTATTAATAAAAAAGTAAA 

GACCTATTCTITGGGTACAAAACAAAAATTACXriTTGCTTCT^ 

ATGAACCGACTAATGGTTTAGATATTGAATCATCACAAATAGTTTTAGCGGTTCTAAAAAAATTAGCTTTACATGA 
AAATGTGGGAATTTTAATATCGAGTCATAAATTAGAAGACATTGAAGAAATTTGTGAGAGAGTTCTTTTCTTGGAG 
AACGGGCTTTTGACATTTCAAAAAGTAGGAAAAGATAGTCATAATTTCTTGTTTGAGATAGCTTT^ 
AGATAGAGACATTTTCATTACCAAACAAGAATTTTGGGATATTGTTTAG 

(SEQ ID NO: 50) 

MRIKEKTNNINGGIKNVSKHYGHSIILKDINFALNKGEIVGLAGRNGVGKSTLMKILVQNNQPTSGNIISSDNVGYLIEEP 

KLFLSKTGLENLKYLSNLYGVDYNQERFRCLIQELDLTQSINKKVKTYSLGTKQKLALLLTLVTEPDILILDEPTNGLDIES 

SQIVLAVLKKLALHENVGILISSHK1.EDIEEICERVLFLENGLLTFQKVGICDSHNFLFEIAFSSATDRDIFITKQEFWDIV 

ID25 1704bp 

(SEQ ID NO: 51) 

atgactgaattagataaacgtcaccck:agtagcatttatgacagcatggttaaatcacctaaccgtgctatgcttc 

gtgcgactggtatgacagataaggactttgaaacatcgattgtgggagtgatttcgacttgggcggaaaatacacc 

atgtaacattcacttck:atgatttcgggaaactggctaaagaaggtgtcaaatctck:agck:gcttggcctgtacag 

tttggaaccattaccgtagcggacgggatcgctatgggaacgcctggtatgcgtttctctctaacatctcgtgacat 

catcgcggactccatcgaggcggctatgagtggtcacaacgtggatgccttcgtcgctatcggtggctgtgacaag 

a ac atgcctgg atct atg attg ct attgct a at atgg at atccc agct attttcgcct atggtgg a act attgc acc 

gggaaatcttgatggtaaagatatcgacttggtttctgtctttgaaggtatcggaaaatggaaccacggtgacatg 

ac agctg agg acgtg a a acgtcttg a atgt a atgcctgccctggccctggtggttgtggtggt atgt at actgct a 

ataccatggcaactck:tatcgaagttctagggatgagtttgccagggtcatcctctcacccagctgaatcagctga 

taagaaagaagatatcgaagcagcaggacgtgctgttgttaagatgttggaacttggtctcaaaccatcagatatc 

ttgactcgtgaagcctttgaagatgctatcactgtaacgatggctctcggtggttctacaaacgccactcttcactt 

gctcgccattgcccatgccgcaaatgttgacttgtcacttgaggacttcaatacgattcaagaacgtgtgcctcact 

tggccgacttgaaaccatctggtcagtatgtcttccaagacctctacgaagtcggtggtgtccctgcggttatgaa 

gt atttgttggc a a atggtttccttc acgg ag atcgc atc ac atgt actggt a ag actgt agctg a a a acttggctg 

actttgcagacttgactccaggccaaaaagttatcatgccacttgaaaatccaaaacgtgcggatggtccgcttat 

catcttgaacgggaaccttgctcctgacggtgcagttgccaaggtatcaggtgttaaagtgcgtcgtcacgttggg 

ccagctaaggtctttgactcagaagaagatgcgattcaggccgttctgacagatgaaatcgttgatggcgatgtag 

tcgttgttcgttttgttggacctaaaggtggtcctggtatgcctgagatgctatcactttcttcaatgattgttggta 

a aggtc aggg ag at a aggtgg ccctcttg acgg ac gg acgtttctctggtggt actt atggtctg gttgttgg ac a 

tatcgctcctgaagctcaggatggtggaccaattgcctatctccgtaccggcgatatcgttacggttgaccaagat 

accaaagaaatttctatggccgtatccgaagaagaacttgaaaaacgcaaggcagaaacaaccttgccaccactt 

t ac agccgtggtgtcctc ggt a a at atgccc ac atcgt atc atctgcttc acgcgg agccgtg ac ag acttctgg a 

atatggacaagtcaggtaaaaaataa 

(SEQ ID NO: 52) 

M TELD KR HRSSIYDSMVKSPNRAM LR ATG M TD KD FETS I VG V I ST W AENTPC N IH LH D FG KL A KEG V KS AG AWP VQ FG 

TITVADGIAMGTPGMRFSLTSRDIIADSIEAAMSGHNVDAFVAIGGCDKNMPGSMIAIANMDIPAIFAYGGTIAPGNLDGK 

DIDLVSVFEGIGKWNHGDMTAEDVKRLECNACPGPGGCGGMYTANTMATAIEVLGMSLPGSSSHPAESADKKEDIEAA 

GRAVVF^l^LGLKPSDILTREAFEDAITVTMALGGSTNATLHLLAIAHAANVDLSLEDFNTIQERVPHLADLKPSGQYVF 

QDLYEVGGVPAVMKYLLANGFLHGDRITCTGKTVAENLADFADLTPGQKVIMPLENPKRADGPLIILNGNLAPDGAVA 

KVSGVKVRRHVGPAKVFDSEEDAIQAVLTDEIVDGDVVVVRFVGPKGGPGMPEMLSLSSMIVGKGQGDKVALLTDGR 

FSGGTYGLVVGHIAPEAQDGGPIAYU^TGDIVTVIXJDTICEISMAVSEEELEKRKAETTLPPLYSRGVLGKYAHIVSSASR 

GAVTDFWNMDKSGKK 

IP26 274bp 



(SEQ ID NO: 53) 
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ATGTTATAATAAAAATAAAGAATTTAAGGAGAAATACAATATGTCAATTTTTATTGGAGGAGCATGGCCATATGCA 
AACCK^TTCGTTACATATTGGTCACGCGGCAGCGCTTT^ 

GAGAGGAAGTTTTATATGTTTCTGGAAGTGATTGTAATGGAACCCCTATTTCTATCAGAGCTAAAAAAGAAAATAA 
GTCTGTG A A AG A A ATTGCTG A 11 I" II ATC AT A AGG A ATTTA ATCC A 

(SEQ ID NO: 54) 

CYNKNKEFKEKYNMSIFIGGAWPYANGSLHIGHAAALLPGDILARYYRQKGEEVLYVSGSDCNGTPISIRAKKENKSVK 
EIADFYHKEFNP 

1D28 1065bp 

(SEQ ID NO: 55) 

atgacaacattattttcaaaaattaaagaagtaacagaacttgctgcagtctcaggtcatgaagcgcctgtccgtg 

cttatcttcgtgaaaagttgacaccgcatgtggatgaagtggtgacagatggcttgggtggtatttttggtatcaa 

acattcagaagctgtggatck:accgcck:gtcttggtcgcttctcatatggacgaagttggttttatggtcagcgaa 

atcaagccagatggtaccttccgtgtcgtagaaatcggtggctggaaccccatggtggttagcagccaacgtttca 

aactcttgactcgtgatggtcatgaaattcctgtgatttcaggttctgttcctccgcatttgactcgtggaaagggg 

ggaccaaccatc^cac^cattgccgatatcgtttttgatggtggttttgcggacaaggctgaggcagaaag 

gcatccgtcctggtgataccattgtaccagatagttctgcaattttgacagccaatgaaaaaaatatcatctcaaa 

agcttgggataaccgctacggtgtcctcatggtaagcgagctagctgaagctttatcgggtcaaaaactcggcaat 

gaactctatctgggttctaacgtccaagaagaagttggtctgcgtggcgctcatacctctacaaccaagtttgacc 

cagaagtcttcctcgcagttgattgctcaccagcaggtgatgtctacggtggtcaaggcaagattggagatggaac 

cttgattcgtttctatgatccaggtcacttgcttctcccagggatgaaggatttccttttgacaacggctgaagaag 

ctggtatcaagtaccaatactactgtggtaaaggcggaacagatgcaggtgcagctcatctgaaaaatggtggtgt 

cccatcaacaactatcggtgtctgcgctcgttatatccattctcaccaaaccctctatgcaatggatgacttcctag 

aagcgcaagctttcttacaagccttck3tgaagaaattggatcgttc^ 

(SEQ ID NO: 56) 

MTTLFSKIK£VTELAAVSGHEAPVRAYLREKLTPHVDEVVTDGLGGIFGIKHSEAVDAPRVLVASHMDEVGFMVSEIKP 
DGTFRVVEIGGWNPMVVSSQRFKLLTRDGHEIPVISGSVPPHLTRGKGGFO^PAIADIVFDGGFADKAEAESFGIRPGDTI 
VPDSSAILTANEKJsIIISKAWDNRYGVLMVSELAEAl^GQKl-GNELYLGSNVQEEVGLRGAHTSTTKroPEVFIJ^VDCSP 
AGDVYGGQGKIGDGTLIRFYDPGHLLLPGMKDFLLTTAEEAGIKYQYYCGKGGTDAGAAHLKNGGVPSTTIGVCARYI 
HSHQTLYAMDDFLEAQAFLQALVKKLDRSTVDLIKHY 

ID31 1182bp 

(SEQ ID NO: 57) 

ATGG A ATTTTCT ATG A A ATC AGTC A A AGG A CT ACTCTTT ATC AT AGCT AGTTTT ATCTTG AC TCTTTTG ACTTGG ATG 

aacacttctccccaattcatgattccaggactagctttaacaagcctatctctgacttttatcctagccactcgtctc 
ccactactagaaagctggtttcacagtttggagaaggtctacaccgtccacaaattcacagcctttctctcaatcat 
cctactaatctttcataactttagtatgggcggtttgtggggctctcgcttagctgctcagtttggcaatcttgccat 
ctatatctttgccagcatcatccttgtcgcctatttaggcaaatacatccaatacgaagcttggcgatggattcacc 
gcctggtttacctagcctatattttaggactctttcacatctacatgataatgggcaatcgtctccttacatttaatc 
ttctaagttttcttgttggtack:tatgccctt^ 

ttccttcccctatctagggaaaattacccatctcaaacgcttaaatcacgatactagagaaattcaaatccatctta 

gcagacctttcaactatcaatcaggacaaritgcctttctaaagattttccaagaaggctttgaaagtgct 

cccttttctatctcaggaggtcatggtcaaactctttactttactgttaaaacttcaggcgaccataccaagaatat 

ctatgataatcttcaagccggcagcaaagtaaccctagacagagcttacggacacatgatcatagaagaaggacg 

agaaaatcaggtttggattgctggaggtattgggatcacccccttcatctcttacatccgtgaacatcctattttag 

AT A A AC A GGTTC ACTTCT ACT AT AGCTTCCGTGG AG ATG A A A ATGC AGTCT ACCT AG ATTT ACTCCGT A ACT ATGCT 
CAGAAAAATCCTAATTTTGAACTCCATCTAATCGACAGTACGAAAGACGGCTATCTTAATTTTGAACAAAAAGAAG 
TGCCCGAACATGCAACCGTCTATATGTGTGGTCCTATTTCTATGATGAAGGCACTTGCCAAACAGATTAAGAAACA 
A A ATCC AA A A AC AG A GC AT ATTT AC 

(SEQ ID NO: 58) 

MEFSMKSVKGLLFIIASFILTLLTWMNTSPQFMIPGLALTSI^LTFI^ 

FSMGGLWGSRi^AQFGNLAIYIFASULVAYLGKYIQYEAWRWIHRLVYLAYILGLFHIYMlMGNRLLTFNLLSFLVGSY 
ALLGLLAGFYIIFLYQKISFPYLGKITHLKRIJNIHDTREIQIHI^RPFNYQSGQFAFLKIFQEGFESAPHPFSISGGHGQTLYFT 
VKTSGDHTKNIYDNLQAGSKVTLDRAYGHMIIEEGRENQVWIAGGIGITPFISYtREHPILDKQVHFYYSFRGDENAVYL 
DLLRNYAQKNPNFELHLIDSTKDGYLNFEQKEVPEHATVYMCGPISMMKALAKQIKKQNPKTEHIY 

ID32 900hp 

(SEQ ID NO: 59) 

ATGACTTTTAAATCAGGCTTTGTAGCCATTTTAGGACGTCCCAATGTTGGG 

GGGGCAAAAGATTGCCATCATGAGTGACAAGGCGCAGACAACGCGCAATAAAATCATGGGAATTTACACGACTGA 
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taagoack:aaattgtctttatcgacacaccagggattcacaagcctaaaacagctctcggacatttcat 
tctgcct ac agt acccttcgcg a agtgg ac actgttcttttc atggtgcctgctg atg a agcgcgtggt a aggggg 
acgatatgattatcgagcgtctcaaggctgccaaggttcctgtgattttggtggtgaataaaatcgataaggtcca 
tccagaccagctcntgtctcagattgatgacttccgtaatcaaatggactttaackjaaattgttccaatctcagccc 
5 ttcagggaaataacgtgtctcgtctagtggatattttgagtgaaaatctggatgaaggtttccaatatttcccgtct 
gatcaaatcacagaccatccagaacgtttcttggtttcagaaatggttcgcgagaaagtcttgcacctaactcgtg 
aagagattccgcattctgtagcagtagttgttgactctatgaaacgagacgaagagacagacaaggttcacatccg 
tgcaaccatcatggtcgagcgcgatagccaaaaagggattatcatcggtaaaggtggcck^tatgcttaagaaaat 
cggtagcatggcccgtcgtgatatcgaactcatgctaggagacaaggtcttcctagaaacctgggtcaaggtcaag 
10 aaaaactggcgcgataaaaagctagatttggctgactttggctataatgaaagagaatactaa 

(SEQ ED NO: 60) 

MTFKSGFVAILGRPNVGKSTFIJ^HVMGQKIAIMS 

TIJlEVDTVLmVPADEARGKGDDMIlERLKAAKVPVILVVNKIDKVHPDQLLSQIDDFRNQMDFKEIVPISALQGNNVSR 
15 LVDILSENLDEGFQYFPSDQITDHPERFLVSEMVREKVLHLTREErPHSVAVVVDSMKRDEETDKVHrRATIMVERDSQK 
GIIIGKGGAMLKKIGSMARRDIELMLGDKVFLETWVKVKKNWRDKKLDLADFGYNEREY 

ID33 855bp 

20 (SEQ ID NO: 61) 

ctck:ttcttgttittacagaaggaggacttatgcctgaatt^ 

attgattataggaaagaagatttcgagtatagaaattcgctaccccaagatgattaagacggatttggaagagttt 
c a a aggg a attgcct agtc ag att atc g agtc a atggg acgtcgtgg a a a at atttgc l"l " i " i " i" i atctg ac ag ac a 
aggtcttgatttcccatttgcggatggagggcaagtatttttactatccagaccaaggacctgaacgcaagcatgc 
25 ccatgtttrctttcattttgaagatggtggcacgcttgtttatgaggatg 
tggtgcctgaccttttagacgtctactttatttctaaaaaattaggtcctc 

caggtctttcaatctgcccttgccaagtccaaaaagcctatcaaatcccatctcctagaccagaccttggtagctgg 
acttggcaatatctatgtggatgaggttctctggcgagctcaggttcatccagctagaccttcccagactttgacag 
cagaagaactcgactck:cattcatgaccagaccattgctgttttgggccaggctgttgaaaaaggtggctccaccat 
30 tcggacttataccaatgcctttggggaagatggaack:atgcaggactttcatcaggtctatgataagactggtcaa 
gaatgtgtacck:tgtggtaccatcattgagaaaattcaactaggcggacgtggaacccacttttgtccaaactgtc 
aaaggagggactga 

(SEQ ID NO: 62) 

35 MLLVFTEGGLMPELPEVETVCRGL£KI-HGKKISSIEKYPKJV1IKTDLEEFQRELPSQ11ESMGRRGKYLLFYLTDKVLISHL 
RMEGKYFYYPDQGPERKHAHVFFHFEDGGTLVYEDVRKFGTMELLVPDLLDVYFISKKLGPEPSEQDFDLQVFQSALA 
KSKKPIKSHLLDQTLVAGLGNIYVDEVLWRAQVHPARPSQTLTAEEATAIHDQTIAVLGQAVEKGGSTIRTYTNAFGED 
GSMQDFHQVYDKTGQECVRCGTIIEKIQLGGRGTHFCPNCQRRD 

40 11)34 633bp 

(SEQ ID NO: 63) 

ttgtcc a a actgtc a a agg aggg actg atggg a a a a atc atcg g a atc actggggg a attgcctctggt a agtc a 
actgtgacaaattttctaagacagcaaggctttcaagtagtggatgccgacgcagtcgtccaccaactacagaaac 

45 ctggtggtcgtctgtttg aggctct agt ac agc actttgggc a ag a a atc attcttg a a a acgg ag a actc aatcg 
ccctctcctagctagtctcatcttttcaaatcctgatgaacgagaatggtctaagcaaattcaaggggagattatcc 
gtgagg a actggctactttgag ag aac agttggctc ag ac aga agag attttcttc atgg atattcccct ac iti" it 
gagcaggactacagcgattggtttch^tgagacttggttggtctatgtggaccgagatck:ccaagtggaacgcttaa 
tgaaaagggaccagttgtccaaagatgaacm:tgagtctcgtctgck:agcccagtgck:ctttagaaaaaaagaaag 

50 atttggccagccaggttcttgataataatggcaatcagaaccagcttcttaatcaagtgcatatccttcttgaggga 
ggtaggcaagatgacagagattaa 

(SEQ ID NO: 64) 

MSKX^K£GLMGKIIGrTGGIASGKSTVTNFLRQQGFQVVDADAVVHQLQKPGGRLFEALVQHFGQEIILENGELNRPLLA 
55 SLIFSNPDEREWSKQIQGEIIREELATLREQLAQTEEIFF^DIPLLFEQDYSDWFAETWLVYVDRDAQVERLMKRDQLSK 
DEAESRLAAQWPLEKKKDLASQVLDNNGNQNQLLNQVHILLEGGRQDDRD 

IP35 1269bp 

60 (SEQ ID NO: 65) 

TTGATAATAATGGCAATCAGAACCAGCTTCTTAATCAAGTGCATATCCTTCTTGAGGGAGGTAGGCAAGATGACAG 

agattaactcgaaggataatctgcck:attgcctggtttggtaatt^ 

cctttt atgccc atcttcgtgg a a a atctaggtgtaggg agtc agc a agtcgc 11111 atgc aggctt agc a atttc 
tgtctctgctatttccgcggcc^tcttttctcctatttggggtattcttgctc 
65 gattcgggcaggtcttgctatgactatcactatgggaggcttggcctttgtcccaaatatctattggttaatctttc 
ttcgtttactaaacggtgtatttgcaggttttgttcctaatgcaacgck:actgatagccagtcaggttccaaaggag 
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aaatcaggctctgccttaggtactttgtctacaggcgt^ 

tatcgcagaattatttgck:attcgtacag i'im icttactggttggtag1 tl tctaltltlagctgctaitngactatt 
tgctttatcaaggaagattttcaaccagtack:caach^ 

aatatccctatcttttgctcaatctctttttaaccagttttgtcatccaattttcagctcaa 
5 ggctctttatgtacgcg actt agggc ag ac ag ag a atcttc i" i" l"l " i gtctctggtttg attgtgtcc agt atgggct 
tttccagcatgatgagtgcaggagtcatgggcaagctaggtgacaaggtggck:aatcatcgtctcttggttgtcgc 
ccagttttattcagtcatcatctatctccrctgtck:caatgcctctack:ccccttcaacragga 
ctttggattgggaaccggtgccttgattcccggggttaatc^ 

tcgagggtctttck:cttcaatcaggtattcttttatctgggaggtgttgttggtcccatggcaggttctgc 
10 aggtcaatttggctaccatgctgtcttttatgcgacaagcctttgtgttgcctttagttgtctc 
atttcgaacattattaaaagtaaaggaaatctag 

(SEQ ID NO: 66) 

MIIMAIRTSFLIKCISFTJ^EVGKMTEINWKDNLRIAWFGNFLTGASISLV 
15 AALFSPIWGILADKYGRKPMMIRAGLAMTrrMGGLAFVPNIYWLIFU^LLNGVFAGFVPNATALIASQVPICEKSGSALGT 
LSTGVVAGTLTGPFIGGFIAELFGIRTVF1-LVGSFLFLAAILTICF[K£DFQPVAK£KAIPTKELFTSVKYPYLLLNLFLTSFVI 
QFSAQSIGPILALYVRDLGQTENLLFVSGLIVSSMGFSSMMSAGVMGKLGDKVGNHRLLVVAQFYSVUYLLCANASSPL 
QLGLYRFLFGLGTGALIPGVNALLSKAITPKAGISRVFAFNQVFT^LGGVVGPMAGSAVAGQFGYHAVFYATSLCVAFS 
CLFNLIQFRTLLKVKEI 

20 

ID36 1311 bp 

(SEQ ID NO: 67) 

ATGGCCCT ACC A ACT ATTGCC ATTGT AGG AC GTCCC A ATGTTGGG A A ATC A ACCCT ATTT A ATCGG ATCGCTGGTG 

25 AGCGAATCTCCATTGTAGAAGATGTCGAAGGAGTGACACGTGACCGTATTTATGCAACGGGTGAGTGGCTCAATCG 
TTCTTTT AGC ATG ATTG AT AC AGG AGG A ATTG ATG ATGTCG ATGCTCCTTTC ATGG A AC A A ATC A AGC ACC AGGC A 
GAAATTGCCATGGAAGAAGCAGATGTTATCGTTTTTGTCGTGTCTGGTAAGGAAGGAATTACTGATGCAGACGAAT 
ACGTAGCTCGTAAGCTTTATAAGACCCACAAACCAGTTATCCTCGCAGTCAACAAGGTGGACAACCCTGAGATGAG 
AAATGATATATATGATTTCTATGCTCTCGGTTTGGGTGAACCATTGCCTATCTCATCTGTCCATGGAATCGGTACAG 

30 GGGATGTGCTAGATGCGATCGTAGAAAATCTTCCAAATGAATATGAGGAAGAAAATCCAGATGTCATTAAGTTTAG 
CTTG ATTGGTCGTCCT A AC GTTGG A A A ATC AAGCTTG ATC A ATGCT ATCTTGGG A G A AG ACCGTGTT ATTGCT AGTC 
CTGTTGCTGGAACAACTCGTGATGCCATTGATACCCACTTTACAGATACAGATGGTCAAGAGTTTACCATGATTGAT 
ACGGCTGGT ATGC GT A AGTCTGGT A AGGTTT ATG A A A AT ACTG AG A A AT ACTCTGTT ATGCGTGCC ATGCGTGCT A 
TTGACCGTTCAGATGTGGTCTTGATGGTCATCAATGCGGAAGAAGGCATTCGTGAGTACGACAAGCGTATCGCAGG 

35 ATTTGCCCATGAAGCTGGTAAAGGGATGATTATCGTGGTCAACAAGTGGGATACGCTTGAAAAAGATAACCACACT 
ATGAAAAACTGGGAAGAAGATATCCGTGAGCAGTTCCAATACCTGCCTTACGCACCGATTATCTTTGTATCAGCTT 
TAACCAAGCAACGTCTCCACAAACTTCCTGAGATGATTAAGCAAATCAGCGAAAGTCAAAATACACGTATTCCATC 
A GCTG TCTTGAACGATGTCATCATGGATGCCATTGCCATCAACCCAACACCGACAGACAAAGGAAAACGTCTCAAG 
ATTTTCTATGCGACCCAAGTGGCAACCAAACCACCAACCTTTGTCATCTTTGTCAATGAAGAAGAACTCATGCACTT 

40 TTCTT ACCTGCGTTTCTTGG A A A ATC A A ATCCGC A AGGCCTTTGTTTTTG AGGG A AC ACCG ATTC A TCTC ATCGC AA 

GAAAACGCAAATAA 

(SEQ ID NO: 68) 

MALPTLAIVGRPNVGKSTLFNRIAGERISIVEDVEGVTRDRrYATGEWLNRSFSMIDTGGIDDVDAPFMEQIKHQAEIAME 
45 EADVIVFVVSGKEGITDADEYVARKLYKTHKPVILAVNKVDNPEMRNDIYDFYALGLGEPLPISSVHGIGTGDVLDAIVE 

NLPNEYEEENPDVIKFSLIGRPNVGKSSLINAILGEDRVIASPVAGTTRDAIDTHFTDTDGQEFTMIDTAGMRKSGKVYEN 

TEKYSVMRAMRAIDRSDVVUVIVINAEEGIREYDKJUAGFAHEAGKGMIIVVNKWDTI^ 

LPYAPIIFVSALTKQRLHIO^EMIKQISESQNTRIPSAVLNDVIMDAIAINPT^ 
^ ELMHFSYLRFLENQIRKAFVFEGTPIHLIARKRK 

1P37 714bp 

(SEQ ID NO: 69) 

atgacagaaaccattaaattgatgaaggctcatacttcagtgcgcaggtttaaagagcaagaaattccccaagta 
55 gacttaaatgagattttgacagcagcccagatgck^atcatcttggaagaatttccaatcctactctgtgattgtggt 
acgaagtcaagagaagaaagatgccttgtatgaattggtacctcaagaagccattcgccagtctgctgttttcctt 
ctctttgtcggagatttgaaccgagcagaaaagggagcccgacttcataccgacaccttccaaccccaaggtgtgg 
aaggtctcttgattagttcggtcgatgcagctcttck:tggacaaaacgccttgttggcagctgaaagcttgggctat 
ggtggtgtgattatcggtttggttcgatacaagtctgaagaagtggcagagctctttaacctacctgactacacct 
60 attctgtctttgggatggcactgggtgtgccaaatcaacatcatgatatgaaaccgagactgccactagagaatgt 
tgtctttgaggaagaataccaagaacagtcaactgaggcaatccaagcttatgaccgtgttcaggctgact atgct 
ggggcgcgtgcgaccacaagctggagtcagcgcctagcagaacagtttggtcaagctgaaccaagctcaactaga 
aaaaatcttgaacagaagaaattattgtag 

65 (SEQ ID NO: 70) 
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MTETIKLMKj\HTSVRRFTCEQEIPQVDLNEILTAAQMASSWKJSIFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFVG 
DLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKSEEVAELFNLPDYTYSVFGMA 
LGVPNQHHDMIO^RLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRIJVEQFGQAEPSSTRK^IXQKKJLL 

ID38 729bp 

(SEQ ID NO: 71) 

atgacagaaattagactagagcacgtcagttatgcctatggtcaggagaggattttagaggatatcaacctacag 

gtg actt caggcgaagtggtttccatcctaggcccaagtggtgttggaaagaccaccctctttaatctaatcgctg 

ggattttagaagttcagtcackjgagaattgtccttgatggtgaagaaaatcccaaggggcgcgtgagttatatg 

gcaaaaggatctgctcttggagcacaagacggtgcttggaaatatcattctgcccctcrtgattcaaaaggtggat 

aaggcagaagctatttcccgagcggataaaattcttgcgaccttccagctgacagctgtaagagacaagtatcctc 

atga acttagcggtggg atgcgccagcgtgtagccttactccgg acctaccim'imm gggcacaagctctttctctta 

gatgaggccittack:gccttggatgagatgacaaagatggaactccacgcttggtatcttgagattcacaagcagt 

tgcagctaacaaccctgatcatcacgcatagtattgaggaggccctcaatctcagcgaccgtatctatatcttgaa 

aaatcgccctgggcagattgtttcagaaattaaactagattggtctgaagatgaggacaaggaagtccaaaagat 

tgcctacaaacgtcaaattttggcggaattaggcttagataagtag 

(SEQ [D NO: 72) 

MTEmLEHVSYAYGQERII^DINLQVTSGEVVSILGPSGVGKTTLFNLIAGILEVQSGRIVLDGEENPKGRVSYMLQICDLL 
LEHKTVLGNIILPLLIQKVDKAEAISRADKILATFQLTAVRDKYPHELSGGMRQRVALLRTYLFGHKLFLLDEAFSALDE 
MTKMELHAWYLEIHKQLQLTTLIITHSIEEALNI^DRIYILKNRPGQIVSEIKLDWSEDEDKEVQKIAYKRQILAELGLDK 

ID39 2433bp 

(SEQ ID NO: 73) 

atgaactattcaaaagcattgaatgaatgtatcgaaagtgcctacatggttgctggacattttggachirrcgttatct 

ag agtcgtggc acttgttg attgcc atgtct a atc ac a gtt at a gtgt agc aggggc a acttt a a atg att atccgt 

atgagatggaccgtttagaagaggtggctttggaactgactgaaacggactatagccaggatgaaacctttacgg 

aattgccgttctcccgtcgtttgcaggttctttttgatgaagcagagtatgtagcgtcagtggtccatgctaaggta 

ctagggacagack:acgtcctctatgcgattttgcatgatagcaatgccttggcgactcgtatcttggagagggctg 

gtttttcttatgaagacaagaaagatcaggtcaagattgctgctcttcgtcgaaatttagaagaacgggcaggctg 

gactcgtgaagatctcaaggctttacgccaacgccatcgtacagtagctgacaagcaaaattctatggccaatatg 

atgggcatgccgcagactcctagtggtggtctcgaggattatacgcatgatttgacagagcaaccgcgttctggca 

agttagaaccagtcatcggtcgggacaaggaaatctcacgtatgattcaaatcttgagccggaagactaagaaca 

accctgtcttggttggggatgctggtgtcgggaaaacagctctggcgcttggtcttgcccagcgtattgctagtggt 

gacgtgcctck:ggaaatggctaagatgcck:gtgttagaacttgatttgatgaatgtcgttgcagggacacgcttcc 

gtggtgactttgaagaacgcatgaataatatcatcaaggatattgaagaagatggccaagtcatcctctttatcga 

tgaactccacaccatcatgggttctggtagcgggattgattcgactctggatgcggccaatatcttgaaaccagcc 

ttggcgcgtggaactttgagaacggttggtgccactactcaggaagaatatcaaaaacatatcgaaaaagatgcg 

gcactttctcgtcgtttcckztaaagtgacgattgaagaaccaagtc^ 

gaaggcgacttatgagaaacatcaccgtgtacaaatcacagatgaagcggttgaaacagcggttaagatggctca 

tcgttatttaaccagtcgtcacttgccagactctgctatcgatctcttggatgaggcggcagcaacagtgcaaaat 

aaggcaaagcatgtaaaagcagacgattcagatttgagtccagctgacaaggccctgatggatggcaagtggaaa 

caggcagcccagctaatcgcaaaagaagaggaagtacctgtctacaaagacttggtgacagagtctgatattttg 

accaccttgagtcgcttgtcaggaatcccagttcaaaaactgactcaaacggatgctaagaagtatttaaatcttg 

aagcagaactccataaacgggttatcggtcaagatcaagctgtttcaagcattagccgtgccattcgccgcaacca 

gtcagggattcgcagtcataagcgtccgattggttcctttatgttcctagggcctacaggtgtcgggaaaactgaa 

ttagccaaggctctggcagaagttctttttgacgacgaatcagcccttatccgctttgatatgagtgagtatatgga 

gaaatttgcagctagtcgtctcaacggagctcctccaggctatgtaggatatgaagaaggtggggagttgacaga 

gaaggttcgcaataaaccctattccgttctcctctttgatgaggtagagaaggcccacccagatatctttaatgttc 

tcttgcaggttctggatgacggtgtcttgacagatagcaagggacgcaaggtcgatttttcaaataccattatcatt 

atgacatcgaatctaggtgcgactgcccttcgtgatgataagactgttggttttggggctaaggatattcgttttga 

ccaggaaaatatggaaaaacgcatgtttgaagaactgaaaaaagcttatagaccggaattcatcaaccgtattga 

tg ag a aggtggtcttcc at agcct atct a gtg atc at atgc agg a agtggtg a ag att atggtc a agccttt agtg 

gc a agtttg actg a a aa aggc a ttgacttg a a attac a agcttcagctctg a aattgtt agc a a atcaagg at atg 

acccagagatgggagctcck:ccacttcgcagaaccctgcaaacagaagtggaggacaagttggcagaacttcttc 

tcaagggagatttagtggcaggcagcacacttaagattggtgtcaaagcaggccagttaaaatttgatattgcata 

A 

(SEQ ID NO: 74) 

MNYSKj\LNEC£ESAYMVAGHFGARYL£SWHLLIAMSNHSYSVAGATLNDYPYEMDRLEEVALELTETDYSQDETFTEL 
PFSRRLQVLFDEAEYVASVVHAKVLGTEHVLYAILHDSNALATRILERAGFSYEDKKDQVKIAALRRNLEERAGWTRED 
LIO\LRQRHRTVADKQNSMANMMGMPQTPSGGLEDYTHDLTEQARSGKi^PVIGRDK£ISRMIQILSRKTKNNPVLVGD 
AGVGKTALALGLAQRIASGDVPAEMAKJvlRVLELDLMNVVAGTRFRGDFEERMNNIIKDlEEDGQVILFIDELHTIMGSG 
SGIDSTLDAANILKPALARGTLRTVGATTQEEYQKHIEKDAALSRRFAKVTIEEPSVADSMTILQGLKATYEKHHRVQIT 
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DEAVETAVKMAHRYLTSRHLPDSAIDLLDEAAATVQNKAKHVKADDSDLSPADKALMDGKWKQAAQLIAKEEEVPV 

YKI)LVTESDILTTl^RLSGIPVQKLTQTDAKKYLNl^AE!^KRVIGQDQAVSSISRAmRNQSGIRSHKRPIGSFMFLGPTG 

VGKTELAIOVLAEVLroDESALlRFDMSEYMEKFAASRLNGAPPGYVGYEEGGELTEKVRNKPYSVLLFDEVEKAHPDIF 

NVIXQVLDDGVLTDSKGRKVDFS^IIIMTSNLGATALRDDKTVGFGAKDIRFDQENMEKJ^MFEELKKAYRPEFI^ 

KVVFHSI^SDHMQEVVKIMVKPLVASLTEKGIDLKLQASALKIXANQGYDPEMGARPLRRTLQTEVEDKLAELLLKGD 

LVAGSTLKIGVKAGQLKFDIA 

ID40 1008bp 

(SEQ ID NO: 75) 

atgaagaaaacatggaaagtgtttttaacgcntgtaacagctct 

g a actgcttct a a a g ac a ac a a ag aggc ag aactt a ag a a ggttg acttt atcct ag actgg ac acc a a at acc a 
accacacagggctttatgttck:caaggaaaaaggttatttcaaagaagctggagt^ 

accagaagaaagttcttctgacttggttatcaacch3aaaggcaccatttgcagtgtatttccaagactacatggct 

aagaaattggaaaaaggagcaggaatcactgccgttck:agctattgttgaacacaatacatcaggaatcatctctc 

gtaaatctgataatgtaagcagtccaaaagacttggttggtaagaaatatgggacatggaatgacccaactgaac 

ttgctatgttgaaaaccttggtagaatctcaaggtggagacittgagaaggttgaaaaagtaccaaataacgactc 

aaactcaatcacaccgattck:caatckjcgtctttgatactgcttggatttactacggttgggatggtatcctt 

aatctcaaggtgtagatgctaacrrcatgtacttgaaagactatgtcaaggagtttgactactattcaccagttatc 

atcgcaaacaacgactatctgaaagataacaaagaagaagctcgcaaagtcatccaagccatcaaaaaaggctac 

caatatgccatggaacatccagaagaagctgcagatattctcatcaagaatgcacctgaactcaaggaaaaacgt 

gactttgtcatcgaatctcaaaaatacttgtcaaaagaatacgcaagcgacaaggaaaaatggggtcaatttgac 

gcagctcgctggaatgctttctacaaatgggataaagaaaatggtatccttaaagaagacttgacagacaaaggc 

ttcaccaacgaatttgtgaaataa 

(SEQ ID NO: 76) 

MKKTWKVFLTLVTALVAVVLVACGQGTASKDNK£AELKKVDFILD^PNTNHTGLYVAKEKGYFKEAGVDVDLKI-P 

PEESSSDLVINGKAPFAVYFQDYMAKK1£KGAGITAVAAIVEHNTSGIISRKSDNVSSPKI)LVGICKYGTWNDPTELAML 

KTLVESQGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDG1LAKSQGVDANFMYLKDYVKEFDYYSPVIIANND 

YLICDNK£EARKVIQAIKKGYQYAMEHPEEAADILIKNAPELK£KRDFVlESQKYLSKEYASDK£KWGQroAARWNAFY 

KWDKENGILKEDLTDKGFTNEFVK 

ID41 762bp 

(SEQ ID NO: 77) 

ttgatgagaaacttgagaagtatactgagacgacacattagtctattgggctttctcggagtattgtcaatctggc 

agttagcaggttttcttaaacttctccccaagtttatcctgccgacacctcttgaaattctccagccctttgttcgtg 

ac ag ag a atttctctggc acc at a gctgggcg accttg a g agtggcttt actggggctg attttggg agttttg att 

gcctgtcttatggctgtgctcatggatagtttgacttggctcaatgacctgatttaccctatgatggtggtcattca 

gaccattccgaccattgccatagctcctatcctggtcttgtggctaggttatgggattttgcccaagattgtcttga 

ttatcttaacgacaacctttcccatcatcgttagtattttggacggtt^ 

ttgttt agtctg atgcgggcc aagccttggc a a atcctgtggc atttt a a a atccc agtt agcctgcctt actttt a 
tgcaggtctgagggtcagtgtctcctacgcctttatcacaactgtggtatctgagtggttgggaggttttgaaggtc 
ttggtgttt at atg attc agtct a a a a a actgtttc agt a tg at acc atgtttgcc att att attctggtgtcg att a 
tcagtcttttgggtatgaagctggtcgatatcagtgaaaaatatgtgattaaatggaaacgttcgtag 

(SEQ ID NO: 78) 

MMRNLRSILRRHISLLGFLGVI^IWQLAGFLKLLPKFILPTPI^ILQPFVRDREFLVmHSWATLRVALLGLILGVLIACLM 
AVLMDSLTWLNDLIYPMMVVIQTIPTIAIAPUAO.WLGYGILPKI^ 

QILWHFKIPVSLPYFYAGLRVSVSYAFITTVVSEWLGGFEGLGVYMIQSKKJ.FQYDTMFAIIILVSIISLLGMKLVD1SEKY 
VIKWKRS 

ID42 372bp 

(SEQ ID NO: 79) 

ttgatttttaatcctatttgctgtatgataagggaaaagaaaggggacagagatatgck:ttttaccaatacccaca 
tgcg atctgct agttttggt attgtt acc agcttgcctg atg ac a tc attg actc i 1 i 1 i ggtatatc atcgaccatt 
tcttaaaaaatgtctttgaattggaagaagaactcgagtttcaattgcttaataaccaaggaaagattaccttcca 
cttttcaagtcaacacctccctacagccattgattttgactttaaccatccntt 
actggttttagacatggacggtagagaaactatcctcctcccagaagaaaatgacctattttaa 

(SEQ ID NO: 80) 

MIFNPICCMIREKKGDRDMAFI7SITHMRSASFGIVTSLPDDIIDSFWYIIDH 
LPTAIDFDFNHPFDPRYPPRVLVLDMDGRETILLPEENDLF 



IP43 1569bp 
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(SEQ ID NO: 81) 

acagcggtgtcattctatctattttaagaaaagtaataatcaattgttaaaaatagtaaaaaaattggaggttctc 
atgaaatattttgttcctaatgaggtattcagtattcgtaaattaaaggtggggacttgctcggtactattggcaat 
ttcaattttgggaack:caaggtattttatcggatgaagttgttactagttcitcaccgatg^ 
ctaatgcaattactaatgatttagataattcaccaactgttaatcagaa 

tcaaccactaatggtttagataattcgttaagtgttaatagcatcagctctaatggtactattcgttccaattcaca 

attagacaacagaacagttgaatctacagtaacatctactaatgaaaataagagttataaggaagatgttataag 

tgacagaattatcaaaaaagaatttgaagatactgctttaagtgtaaaagattatggtgcagtaggtgatgggatt 

catgatgatcgacaagcaattcaagatgcaatagatgctgcagctcaagggctaggtggaggaaatgtatattttc 

ctgaaggaacttatttagtaaaagaaattgtttttttaaaaagtcatacacacttagaattgaatgagaaagctac 

aattctaaatggtataaatattaagaatcacccttccattgtttttatgacaggtttatttacggatgatggtgcgc 

aagtagaatggggcccaacagaagatattagttattctggtggtacgattgatatgaacggtgctttgaatgaaga 

aggaactaaagcaaaaaatctaccacttataaattcttcaggtck:atttck:tattgggaattcaaataacgtaact 

ataaaaaatgtaacattcaaggatagttatcaaggck:atck:tattcaaattgcaggttcgaaaaatgtattagttg 

ATAATTCTCGTTTTCTTGGGCAAGCCTTACCCAAAACGATGAACKjATGGGCAAATCATAAGTAAGGAGAGCATT 

gattgaaccattaactagaaaaggttttccttatgccitgaatgatgatgggaaaaaatctgaaaatgtgactatt 

caaaattcctattttggcaaaagtgataaatctgck}gaattagtaacagcaattggcacacactatcaaacattgt 

cgacacagaacccctctaatattaaaattcaaaataatcattttgataacatgatgtatgcaggtgtacgttttaca 

ggattcactgatgtattaatcaaaggaaatcgctttgataagaaagttaaaggagagagtgtacattatcgagaa 

agcggagcagctttagtaaatgcttatagctataaaaacactaaagacctattagatttaaataaacaggtggtta 

tcgccgaaaatatatttaatattgccgatcctaaaacaaaagcgatacgagttgcaaaagatagtgcagaatgttt 

aggaaaagtatcagatattactgtaacaaaaaatgtaattaataataattctaaggaaacagaacaaccaaatat 

tgaattattacgagttagtgataatttagtagtctcagagaatagt 

(SEQ ID NO: 82) 

QRCHSIYFKKSNNQLLKIVKKIXVLMKYFVPNEVFSIRKI.KVGTCSVLLAISILGSQG^DEVVTSSSPMATFCESSNArrN 

DU^NSPTVNQNRSAEMIASNSTTNGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDT 

ALSVKDYGAVGDGIHDDRQAIQDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVF 

MTGLFTDDGAQVEWGPTEDISYSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAG 

SKNVLVDNSRF^GQALPKTMKDGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHYQT 

LSTQNPSNIKIQNNHroNMMYAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVI 

AENIFNIADPKTKAIRVAKDSAECLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENS 

ID44 324bp 

(SEQ ID NO: 83) 

GTGATGAAAGAAACTCAGCTATTAAAAGGTGTTCTTGAAGGTTGTGTCTTGGATATGATTGGTCAAAAAGAGCGGT 
ATGGTTATGAGTTGGTTCAGACTTTGCGAGAGGCTGGATTTGATACTATCGTTCCAGGAACTATTTATCCTTTGTTG 
CAAAAGTTAGAAAAAAATCAATGGATAAGAGGCGACATGCGCCCGTCGCCAGATGGTCCAGATCGGAAGTATTTT 
TCATTAATGAAAGAAGGAGAAGAGCGTGTCTCAGTCTTTTGGCAACAATGGGACGATTTGAGTCAAAAAGTAGAA 
GGGATTAAGAATGGGGGTTAA 

(SEQ ID NO: 84) 

MMKETQLLKGVLEGCVLDMIGQKERYGYELVQTLREAGFDTIVPGTIYPLLQKLEKNQWIRGDMRPSPDGPDRKYFSL 
MKEGEERVSVFWQQWDDLSQKVEGIKNGG 

1D45 816bp 

(SEQ ID NO: 85) 

atgaagaaaatgaagtattacgaagaaacaack:gctttck:tacatgagttttctgaggagaatcaaaagtatttt 

aggagttgtgggaaagttttaatcttgctggatttctctatgatgaagactatctcagagagcagatctatttgatg 

atgctagatttctcagaagcagaacgagatggcatgagtgcagaggattatctaggtaagaatcctaaaaaaata 

atgaaagagattctcaagggagcacctcgcagttctatcaaagagtcccttttgacgccaattcttgtcctggcgg 

t attacgtt att atc a act act a agtg atttttct a a aggtcctctctt a ac agtc a atttgctc ac a r 11 " ii agggc 

aacttcntatttttctgattggatttggacttgtggccacaattttacgaagaagtttagtccaag 

atgaaaattggcacttacattgttgttgggactatagttcttctagttgttttaggatatgtaggaatggcaagctt 

catacaagaaggagccttttatattccckxtccctgggatagtttc 

tttggaattggaaagaack:ggtctttcgtccatttgtcagtatgattat^^ 

gttattatgagtggatgggaatttcaaatgttttccttacaaaagttattcct^ 

ttgtcttgttccgtgggtttaagaagataaaatggagtgaagtatag 

(SEQ ID NO: 86) 

MKKMKYYEETSALLHEFSEENQKYFEELWESFNLAGFLYDEDYLREQrfUvlMLDFSEAERDGMSAEDYLGK>JPKKIM 
KEILKGAPRSSIKESLLTPILVLAVLRYYQLI^DFSKGPLLTVNLLTFLGQLLIFLIGFGLVATIU^RSLVQDSPKMKIGTYIV 
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VGTIVLLVVLGYVGMASFIQEGAFYIPAP\\T>SI^VFTISLVIGIWNWKEAVFRPF^ 
LTKVIPLAVLFIGIFVLFRGFKKIKWSEV 

11)46 348bp 

(SEQ ID NO: 87) 

ctgtttttttatttatactcaatgaaaatcaaagagcaaacta 

tgaggttgtagacgaaactgacgaagtcagctcaaaacatgtttttgackjttgtagatgaaactgacgaagtca^ 
tcaaaacactgttttgacm3ttgtagatgaaactgacgaagtcagcrcaaaacactgttttgaggttgtagatgaaa 
ctgacgaagtcack:tcaaaacatgtttttgaggttgtagatgaaactgacgaagtcagtaaccatacatacggtag 
gck:gacgctgacgtck3tttgaagagattttcgaagagtattaa 

(SEQ ID NO: 88) 

MFWLYSMKIKEQTRKI.AAGCSKHCFEVVDETDEVSSKKVFEVVDETDEVSSKHCFEVVDETDEVSSKHCFEVVDETDE 
VSSKHVFEVVDETDEVSNHTYGRATLTWFEEIFEEY 

ID47 1260bp 

(SEQ ID NO: 89) 

atgcagaatctgaaatttgccttttcatctatcatggctc 

atcggtgtttcatcagttgttgtgattatggctttgggtgattccctatctcgtcaagtcaataaagatatgactaa 

atctcagaaaaatattagcgtctttttctctcctaaaaaaagtaaagacgggtcttttactcagaaacaatcagctt 

ttacggtttctggaaaggaagaggaagttcctgttgaaccgccaaaaccgcaagaatcctgggtccaagaggcag 

ctaaactgaagggagtggatagttactatgtaaccaattcaacgaatgccatcttgacctatcaagataaaaaggt 

tgagaatck:taatttgacaggtggaaacagaacttacatggacgctgttaagaatgaaattattgcaggtcgtagt 

ctgagagagcaagatttcaaagagtttgcaagtgtcattttgctagatgaggaattgtccattagtttatttgaatc 

tcctcaagaggctattaacaaggttgtagaagtcaatggatttagttaccgggtcattggggtttatactagtccg 

GAGCCTAAAAGATCAAAAATATATCKjGTTTGGTCKjCTTC^^ 

tgtagatgaaatagctaatattgtctttcgagtgaatgataccagtttaaccccaactctgggtccagaactggca 
cgaaaaatgacagagcttgcaggcttacaacagggagaataccaggtggcagatgagtccgttgtatttgcagaa 
attcaacaatcgtttagttttatgacgacgattattagttccatc 

tgtcatgaacatcatgctggtttcggtgacagagcgcactcgtgagattggtcttcgtaaggctttgggtgcaaca 

cgtgccaatattttaattcagtttttgattgaatccatgattttgaccttgttaggtggcttaatt 

gcaagtggtttaactgccttagcaggtttgttactgcaaggtttaatagaaggtatagaagttggagtatcaatcc 

cagtcck:cctatttagtcttck:agtttcggctagtgttggtatgatttttggagtcttgccagccaaca 

aaacttgatccaattgaagcccttcgttatgaatga 

(SEQ ID NO: 90) 

MQNLKFAFSSIMAHKMRSLLTMIGIHGVSSVVV[MALGDS1^RQVNKJ)MTKSQKNISVFFSPKKSKDGSFTQKQSAFTVS 

GKEEEVPVEPPKPQESWVQEAAKLKGVDSYYVTNSTNA1LTYQDKKVENANLTGGNRTYMDAVKNEIIAGRSLREQDF 

KEFASVILLDEELSISLFESPQEAINKVVEVNGFSYRVIGVYTSPEAKRSKIYGFGGLPITTNISLAANFNVDEIANIVFRVN 

DTSLTPTLGPELARK^TELAGLQQGEYQVADESVVFAEIQQSFSFMTTIISSIAGISLFVGGTGVMNIMLVSVTERTREIGL 

RKALGATRANILIQFLIESMILTLLGGLIGLTIASGLTALAGLLLQGLIEGIEVGVSIPVALFSLAVSASVGMIFGVLPANKA 

SKLDPIEALRYE 

ID48 705bp 

(SEQ ID NO: 91) 

ctgatgaagcaactaattagtctaaaaaatatcttcagaagttaccgtaatggtgaccaagaactgcaggttctca 

aaaatatcaatctagaagtgaatgagggtgaatttgtagccatcatgggaccatctgggtctggtaagtccactct 

gatgaatacgattgck:atgttggatacaccaaccagtggagaatattatcttgaaggtcaagaagtggctgggctt 

ggtgaaaaacaactagctaaggtccgtaaccaacaaatcggttttgtctttcagcagttctttcttctatcgaagct 

caatgctctgcaaaatgtagaattgcccttgatttacgcaggagtttcgtcttcaaaacgtcgcaagttggctgag 

gaatatttagacaach3ttgaattgacagaacgtagtcaccatttaccttcagaattatctggtggtcaaaagcaac 

gtgtagccattgcgcgtck:cttggtaaacaatccttctattatcctagcggatgaaccgacaggagccttggatac 

caaaacaggtaaccaaattatgcaattattggttgatttgaataaagaaggaaaaaccattatcatggtaacgcat 

gagcctgagattgctck:ctatgccaaacgtcagattgtcattcgggatck3ggtcatttcgtctgacagtgctcagtt 

aggaaaggaggaaaactaa 

(SEQ ID NO: 92) 

MMKQUSLKNIFRSYRNGDQELQVLKNINLEVNEGEFVAIMGPSGSGKST1JV1NTIGMLDTPTSGEYYLEGQEVAGLGEK 
QLAKVRNQQIGFVFQQFFLLSKLNALQNVELPLIYAGVSSSKRRKLAEEYLDKVELTERSHHLPSELSGGQKQRVAIARA 
LVNNPSIILADEPTGALDTKTGNQIMQLLVDLNKEGKTIIMVTHEPEIAAYAKRQIVIRDGVISSDSAQLGKEEN 



ID49 1200bp 
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(SEQ ID NO: 93) 

atg a ag a a a a ag a atggt a aa gct a a a a agtggc aactgt atgc agc a atccgtgctgcg agtgt agttgt attg 
ggtgctggggggattttactcittagacaaccttctcagactgctctaa 
caaggaaggaagcgtggcctcctctgttttattgtcagggacagtaacack:a^ 
gatgctagtaagggtgatttagatgaaatccttgttt^ 

agtacagtagttcagaagcgcaggcckjcctatgattcagctagtcgagcagtagctagggcagatcgtcatatca 

atgaactcaatcaagcacgaaatgaagccgcttcagctccggctccacagttaccagcgccagtaggaggagaag 

atgcaacggtgcaaagcccaactccagtggctggaaattctgttgcttctattgacgctcaattgggtgatgcccg 

tgatgcgcgtgcagatgctgcggcgcaattaagcaaggctcaaagtcaattggatgcaacaactgttctcagtacc 

ctagagggaactgtggtcgaagtcaatagcaatgtttctaaatctccaacaggggcgagtcaagttatggttcata 

ttgtcagcaatgaaaatttacaagtcaagggagaattgtctgagtacaatctagccaacctttctgtaggtcaaga 

agt a agcttt acttct a a agtgt atcctg at a a a a aatgg actggg a a att a agct at atttctg act atcct a a a 

aacaatggtgaaccagctagtccagcackzcgggaataatacaggttctaaatacccttatactattgatgtgacag 

cx:gaggttggtgatttgaaacaaggtttttctgtcaacattgaggt^ 

gttagcagtctagtaatggatgatagtaaaaattatgtctggattgtggatgaacaacaaaaggctaaaaaagtt 

gaggtttcattgggaaatck:tgacgcagaaaatcaagaaatcacttctggtttaacgaacggtgctaaggtcatca 

gtaatccaacatcttccttggaagaaggaaaagaggtgaaggctgatgaagcaactaattag 

(SEQ ID NO: 94) 

MKKKisfCKAKKWQLYAAIGAASVVVLGAGGIIJ^FRQPS 

ASKGDLDEILVSVGDKVSEGQALVKYSSSEAQAAYDSASRAVARADRHINELNQARNEAASAPAPQLPAPVGGEDATV 

QSPTPVAGNSVASIDAQLGDARDARADAAAQLSKAQSQLDATTVLSTLEGTVVEVNSNVSKSPTGASQVMVHrVSNEN 

LQVKGEL^EYNLANI^VGQEVSFTSKVYPDKKWTGICLSYISDYPKNNGEAASPAAGNNTGSKYPYTIDVTGEVGDLKQ 

GFSVNIEVKSKTKAILVPVSSLVMDDSKNYVWIVDEQQKAKKVEVSLGNADAENQE1TSGLTNGAKVISNPTSSLEEGKE 

VKADEATN 

IPSO 759bp 

(SEQ ID NO: 95) 

atgtcacgtaaaccatttatcgctggtaactggaaaatgaacaaaaatccagaagaagctaaagcattcgttgaa 

gc agttgc atc a a a acttccttc atc ag atcttgttg a agc aggt atcgctgctcc agctcttg atttg ac a actgt 

tcttgctgttgc a a a aggctc a a acctt aa agttgctgctc a a a actgct actttg a a a atgc aggtgctttc actg 

gtgaaactagcccacaagttttgaaagaaatcggtactgactacgttgttatcggtcactcagaacgccgtgacta 

cttccatgaaactgatgaagatatcaacaaaaaagcaaaagcaatctttgcgaacggtatgcttccaatcatctgt 

tgtggtg a atc acttg a a actt acg a agctggt a a a gctgctg a attcgt aggtgctc a agt atctgctgc attg g 

ctggattgactgctgaacaagttgctgcctcagttatcgcttatgagccaatctgggctatcggtactggtaaatca 

ck:ttcacaagacgatck:acaaaaaatgtgtaaagttgttcgtgacgttgtagctgctgactttckjTCaagaagtcg 

cagacaaagttcgtgttcaatacggtggttctgttaaacctgaaaatgttgcttcatacatggcttgcccagacgtt 

gacggtgcccttgtaggtggtgcgtcacttgaactctgaaagcttcttggctttgcttgactttgtaaaataa 

(SEQ ID NO: 96) 

MSRKPFIAGNWKMNKNPEEAKj\FVEAVASKLPSSDLVEAGIAAPALDLTTVLAVAKGSNLKVAAQNCYFENAGAFTGE 
TSPQVLK£IGTDYVVIGHSERRDYFHETDEDINKKAKAIFANGMLPIICCGESLETYEAGKAAEFVGAQVSAALAGLTAE 
QVAASVIAYEPIWAIGTGKSASQDDAQKMCKVVRDVVAADFGQEVADKVRVQYGGSVICPENVASYMACPDVDGALV 
GG ASLEAES FLALLDFV K 

IP51 1473bp 

(SEQ ID NO: 97) 

ttgaaaacaaaaattggattagcaagtatctgtttactagck:ttggcaactagtcatgtcgctgcaaatgaaactg 

aagtagcaaaaacttcgcaggatacaacgacagcttcaagtagttcagagcaaaatcagtcttctaataaaacgc 

aaacgagcgcagaagtacagactaatgctgctgcccactgggatggggattattatgtaaaggatgatggttcta 

aagctcaaagtgaatggatttttgacaactactataaggcttggttttatattaattcagatggtcgttactcgcag 

aatgaatggcatggaaattactacctgaaatcaggtggatatatggcccaaaacgagtggatctatgacagtaatt 

acaagagttggttttatcrcaagtcagatggggcttatgctcatcaagaatggcaattgattggaaataagtggta 

ctacttcaagaagtggggttacatggctaaaagccaatggcaaggaagttatttcttgaatggtcaaggagctatg 

atgcaaaatgaatgck:tctatgatccagcctattctgcttatttttatctaaaatccgatggaacttatgctaacca 

agagtggcaaaaagtgggcggcaaatggtactatttcaagaagtggggctatatggctcggaatgagtggcaagg 

caactactatttgactggaagtggtgccatggcgactgacgaagtgattatggatggtactcgctatatctt^ 

gcctctggtgack:tcaaagaaaaaaaagatttgaatgtcgck:tgggttcacagagat^ 

ataatagagaagaacaagtgggaaccgaacatgctaagaaagtcattgatattagtgagcacaatggtcgtatca 

atgattggaaaaaggttattgatgagaacgaagtggatggtgtcattgttcgtctaggttatagcggtaaagaaga 

caaggaattggcckiataacattaaggagttaaaccgtctgggaattccttatckjtgtctatctctatacctatgctg 

aaaatgagaccgtgctgagagtgacgctaaacagaccattgaacttataaagaaatacaatatgaacctgtctta 

ccctatctattatgatgttgagaattgggaatatgtaaataagagcaagagagctccaagtgatacaggcacttgg 

gttaaaatcatcaacaagtacatggacacgatgaagcaggcgggttatcaaaatgtgtatgtctatagctatcgta 
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GTTTATTACAGACCTCGTTTAAAACACCCAGATATTTTA^ 

AGAATGGGAAAACCCTCATTATTCAGGAAAAAAAGGTTGGCAATATACCTCTTCTGAATACATGAAAGGAATCCA 
AGGGCGCGTAGATGTCAGCGTTTGGTATTAA 

(SEQ ID NO: 98) 

MKTKIGLASICLUjLATSHVAANETEVAKTSQDTTTASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQ 

SEWIFDNYYKAWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYY 

FKKWGYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVCGKWYYFKKWGYMARNE 

WQGNYYLTGSGAMATDEVIMDGTRYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVIDISEHNGR 

fNDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAENETDAESDAKQTIELIKKYNMNLSYPI 

YYDVENWEYVNKSKRAPSDTGTWVKIINKYMDTM KQAGYQNVYVYSYRSLLQTRLKHPDILKHVNWVAAYTNALE 

WENPHYSGKKGWQYTSSEYMKGIQGRVDVSVWY 

ID52 774bp 

(SEQ ID NO: 99) 

ATGAAAAAATTTGCCAACCTTTATCTGGGACTGGTCTTTCTGGTCCTCTACCTGCCTATCTTTTACTTGATTGGCTAT 

GCCTTTAATGCTGGTGATGATATGAATAGCTTTACAGGTTTTAGCTGGACTCACTTTGAAACCATGTTTG 

G AG ACTC ATGCTG ATTTTGGCTC AG AC A I " 11 T 1 CTTGGCCTTCCTATC AGCCTTG ATAGCG ACC ATTATCGGG ACTTT 

TGGTCK^CATTTACATCTACCAGTCTCGTAAGAAATACCAAGAAGCCTTTCTATCACTCAATAATATCCTCATGGTTG 

CGCCTGACGTTATGATTGGTGCTAGCTTCTTGATTCTCTTTACCCAACTCAAGTTTTCACTTGGCIMMMM'GACCGTT^ 

TATCTAGTCACGTGGCCTTCTCCATTCCTATCGTGGTCTTGATGGTCTTGCCTCGACTCAAGGAAATGAATGGCGAC 

ATGATTCATGCGGCCTATGACTTGGGAGCTAGTCAATTTCAGATGTTCAAGGAAATCATGCTTCCTTACCTGACTCC 

GTCTATCATTACTGGTTATTTCATGGCCTTCACCTATTCGTTAGATGACITTGCCGTGA 

TGGCTTTTCAACCCTATCAGTCGAGATTTACTCTCGTGCTCGCAAGGGGATTTCCTTAGAAATCAATGCCCTGTCTG 
CTCTAGTCTTTCTCTTTAGTATTATCCTAGTTGTAGGTTATTACTTTATCTCTCGTGAGAAGGAGGAGCAAGCATGA 

(SEQ ID NO: 100) 

MKKFANLYLGLVFLVLYLPIFYLIGYAFNAGDDMNSFTGFSWTHFET^FGDGRLMLILAQTFFLAFLSALIATIIGTFGAI 

Yr^QSRKKYQEAFLSlJMNILMVAPDVMIGASFLlLFTQLKFSLGFLTVLSSHVAFSIPIVVLJVIVLPRLKEMNGDMIHAA 

LGASQFQMFKJEIMLPYLTPSIITGYFMAFTYSLDDFAVTFWTGNGFSTI^ 

YYFISREKEEQA 

ID59 1071bp 

(SEQ ID NO: 101) 

ATG A A A A A A ATCT ATTC A I " I" 1 " I" I AGC AGG A ATTGC AGCG ATT ATCCTTGTCTTGTGGGG A ATTGC G ACTC ATTT AG A 

TAGTAAAATCAATAGTCGAGATAGTCAAAAATTGGTTATCTATAACTGGGGAGACTATATCGATCCTGAACTCTTG 

ACTCAGTTTACAGAAGAAACAGGAATTCAAGTTCAGTACGAGACTTTTGACTCCAACGAAGCCATGTACACTAAGA 

TAAAGCAGGGTGGAACGACCTACGATATTGCCATTCCAAGTGAATACATGATTAACAAGATGAAGGACGAAGACC 

TCTTGGTTCCGCTTGATTATTCAAAAATTGAAGGAATCGAAAATATCGGACCAGAGTTTCTCAACCAGTCCTTTGAC 

CCAGGTAATAAATTCTCCATCCCTTACTTCTGGGGAACCTTAGGAATTGTCTACAACGAAACCATGGTAGATGAAG 

CGCCTGAGCATTGGGATGACCTTTGGAAGCCGGAGTATAAGAATTCTATCATGCTCTTTGATGGGGCGCGTGAGGT 

GCTGGGACTAGGACTCAATTCCCTCGGCTACAGCCTCAACTCCAAGGATCTGCAGCAGTTGGAAGAGACAGTGGAT 

AAGCTCTACAAACTGACTCCAAATATCAAGGCTATCGTTGCGGACGAGATGAAGGGCTATATGATTCAGAATAATG 

TTGCAATCGGCGTGACCTTCTCTGGTGAAGCCAGCCAAATGTTAGAAAAAAATGAAAATCTACGTTATGTGGTACC 

GACAGAGGCCAGCAATCITTGGTTTGACAATATGGTCATTCCCAAAACAGTTAAAAACCAAAACTCAGCCTATGCC 

TTTATCAACTTTATGTTGAAACCTGAAAATGCTCTCCAAAATGCGGAGTATGTCGGCTATTCAACACCAAACCTACC 

AGCGAAGGAATTGCTCCCAGAGGAAACAAAGGAAGATAAGGCCTTCTATCCCGATGTTGAAACCATGAAACACCT 

AGAAGTTTATGAGAAATTTGACCATAAATGGACAGGGAAATATAGCGACCTCTTCCTACAGTTTAAAATGTATCGG 

AAGTAG 

(SEQ [D NO: 102) 

MKKIYSFLAGIAAIILVLWGIATHLDSKINSRDSQKLVIYNWGDYrDPELLTQFTEETG(QVQYETFDSNEAMYTKIKQGG 

TTYDIAIPSEYMINKMKDEDLLVPLDYSKIEGIENIGPEFLNQSFDPGNKFSIPYFWGTLGIVYNETMVDEAPEHWDDLW 

KPEYKJ^SIMLFDGAREVLGLGLNSLGYSLNSKDLQQLEETVDKLYKJ^TPNIKAIVADEMKGYMIQNNVAIGVTFSGEAS 

QMLEKNENLRYVVFTEASNLWFDNMVIPKTVKi^QNSAYAFINFMLKPENALQNAEYVGYSTPNLPAKELLPEETK£D^ 

AFYPDVETMKHLEVYEKFDHKWTGKYSDLFLQFKMYRK 

ID61 1851bp 

(SEQ ID NO: 103) 

ATGAATAAAAAACTAACAGATTATGTGATTGATCTGGTGGAAATTTTAAATAAACAACAAAAGCAGGTTTTCTGGG 
GAATATTTGATATTTTCAGTATGGTGGTTTCCATCATTGTATCTTAT^ 

CTGTTGACTACATTATCTATACGAGTTTGGCCTTCCTGTTCTATCAATTGATGATTGGTTTT^ 

GC ATT A GTCGTT AC A GC A AG ATT ACGG ATTTC ATG A A A ATCTTTTTTGGTGTG ACTGCT AGC AGTGTCTTGTC AT AT 
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agtatctgttatc^cttcttgccactcttctccatc 

tattgccacggattacttggcagttaatctactccagacgcaaaaaaggtagtggtgatggagaacaccgtcggac 

cttcttgattggtgccggtgatggtggggctctttttatggatagttaccaacatccaaccagtgaattagaactg 

tcggtattttggataaggattctaagaaaaagggtcaaaaacttggtckjtattcct 

ctgcctgaattagccaaacck:catcaaatcgagcgtgtcatcgttgcgattccgtcgctggatccgtcagaatatg 
agcgtatcttgcagatgtgtaatauagctgggtgtcaaatgttacaagatgcctaaggttgaaactgttgttcaggg 

CCTTCACCAAGCAGGTACTGCK:TTCCAAAAAATTGATATTACGGACCTTTTCKjGTCGTCACKjAAATC 

gaatcgcgtctgggtgcagaactgacaggtaagaccatcttagtcacack3acct 
tctgtcgtcaagttagtcgcttcaatcctgaacgcattgtcttgctc 

tatcatgaattgattcgtaagttccaagggattgattatgtacctgtgattgcggacattcaagactatgatcgttt 
gttgcaagtctitgagcagtacaaacctgctattgtttatc 

gcaatccaaaagaagccttcaaaaacaatatccgtggaachtacaatgttgctaaggctgttgatgaagctaaagt 
gtctaagatggttatgatttcgacagataaggcagtcaatccaccaaatgttatgggagcaaccaagcgcgtggcg 
gagttgattgtcactgck:tttaaccaacgtagccaatcaacctactc^ 

ccgtggtagtgtcattccagtctttgaacgtcagattgctgaaggtgggcctgtaacggtgacagacttccgtatg 

acccgttactttatgaccattccagaach:tack:cgtctggttatccatgctggtgcttatgccaaagatggggaagt 

ctttatccttgatatgggcaaaccagtcaagatttatgacttggccaagaagatggtgcttctaagtggccacact 

gaaagtgaaattccaatcgttgaagttggaatccgcccaggtgaaaaactctacgaagaactcttggtatcaaccg 

aactcgttgataatcaagttatggataagattttcgttggtaaggttaatgtcatgcctttagaatccatcaatcaa 

aagattggagagttccgcactctcagtggagatgagttgaagcaagctattatcgcctttgctaatcaaacaaccc 

acattgaataa 

(SEQ ID NO: 104) 

MNKK1.TDYVIDLVEILNKQQKQVFWGLFDIFSMVVSIIVSYILFYGLINPAPVDY1IYTSLAFLFYQLMIGFWGLNASISRYS 

KrTDFMKIFFGVTASSVI^YSICYAI^PLFSIRniLFlLl^TFLILLPRrrWQLIYSRRKKGSGDGEHRRTFLIGAGDGGALFM 

DSYQHPTSEl^LVGILDKDSFCKKGQKLGGIPVLGSYDNLPELAKRHQIERVfVAIPSLDPSEYERILQMCNKLGVKCYICM 

PKVETVVQGLHQAGTGFQKIDITDLLGRQEIRLDESRLGAELTGKTILVTGAGGSIGSEICRQVSRFNPERIVLLGHGENSI 

YLVYHELIRKFQGIDYVPVIADiQDYDRU.QVFEQYKPAIVYHAAAHKJHVPMMERNPK£AFKNNlRGTYNVAKAVDEA 

KVSKMVMISTDKAVNPPNVMGATKRVAEUVTGF^QRSQSTYCAVRFGNVLX3SRGSV1PVFERQIAEGGPVTVTDF^ 

RYFMTIPEASRLVIHAGAYAKDGEVFILDMGKPVKIYDLAKKMVLLSGHTESEIPIVEVGIRPGEKLYEELLVSTELVDNQ 

VMDKIFVGKVNVMPLESINQKIGEFRTLSGDELKQAIIAFANQTTHIE 

ID101 1338bp 

(SEQ ID NO: 105) 

atgattgaactttatgatagttacagtcaagaaagtcgagatttacatgaaagtctagtcgctactckjtctttctca 
acttggagtggtcatcgatgcagatggttitctck:ctgatcgtc^ 

ggatggaaaacctctctattttaatcaagttcccgtttcagatttttgggaaattttaggagataatcag 

gtattgaagatgtgacgcaggagagggctgtcattcattatgctgatggaatgcaggctcgcttggttaaacaggt 

agactggaaagacctagaaggtcgagtacgtcaggttgaccactacaatcgcttcggagcttgttttgctacaacg 

acttatagcgcagatagcgagccgattatgacagtttaccaagatgtcaatggtcaacaagttttactggaaaacc 

atgtgacgggtgatatcttattgactttgccaggtcagtccatgcgttactnttgcaaataaagttgaatttatcacc 

ttctttttgcaagatttggaaatagataccagtcagcttatctttaatactctagcgactcctttct^ 

c atc atcc ag at a a atctggctcgg atgtcttggt atggc agg a acctctct atg atgcc attcc aggt a at atgc a 

gttgattttggaaagtgataatgtgcgtactaagaagatcatcattccaaataagck:gacttatgagcgcck:ttta 

gagttaactgacgagaaataccatgatcagtttgtgcacttgggttatcattaccagttcaaacgtgataatttcct 

aagacgagatgccttaatcttgaccaattcagatcagattgagcaagtagaagcaatcgcaggagccttgcctgat 

gtcactttccgtattgcagcggtgacagagatgtcttctaagctcttagacatgctttgctatcctaatgtggccct 

ttaccagaacgctagtccacagaagattcaggagctgtatcaactgtcggatatttacttggatataaaccacagt 

aatgagttgctacaggcagtgcgtcaggcctttgagcacaatctcttgattcttggctttaatcagacggtgcaca 

atagactttatatcgctccagaccatctatttgaaagtagtgaagttgctcknttggttgagaccattaaattggcc 

ctttcagatgttgatcaaatgcgtcaggcacttggcaaacaaggccaacatgcaaattatgttgacttggtgagat 

atcaggaaaccatgcaaactgttttaggaggctaa 

(SEQ ID NO: 106) 

M1ELYDSYSQESRDLHESLVATGLSQLGVVIDADGFLPDGLLSPFTYYLGYEDGKPLYFNQVPVSDFWEILGDNQSACIE 

DVTQERAVIHYADGMQARLVKQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVLLENHVT 

GDILLTLPGQSMRYFANKVEFITFFLQDLEIDTSQLIFNTLATPFLVSFHHPDKSGSDVLVWQEPLYDAIPGNMQLILESDN 

VRTKKUIPNKATYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEAIAGALPDVTFRIAAVTEMS 

SKLLDMLCYPNVALYQNASPQKIQELYQLSDIYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFESSEV 

AALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGG 

ID102 1512bp 



(SEQ ID NO: 107) 
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ATGACAATTTACAATATAAATTTAGGAATTGGTTGGGCTAGTAGCGGTGTTGAATACGCTCAAGCCTATCGTGCTG 

GTGTTTTTCGGAAATTAAATCTGTCCTCTAAGTTTATCTTTACAGATATGATTTTAGCCGATAATATT 

C AGCC A AT ATTGGTTTTG ATG AT A ATC AGGTT ATCTGGCTTT AT A ATC ATTTC AC AG AT ATC A A A ATTGC ACCT ACT 

ACKTGTGACAGTCKjATGATGTCTTGGCTTACTTrGGTGGTC 

GTGTATTCTTTTTTGACCAAGATAAGTTTGTAACCTGTTATTTGGTTGATGAGAACAAGGACTTGGTTCAA 

GAGTATGTTTTTAACKjGAAACCTGATTCGGAAGGATTACTTTTCITATACGCGTTATTGTAGCGAG 

CAAGGACAATGTTGCAGTCTTATACCAACGAACTTTTTATAATGAAGACGGGACTCCAGTCTATGATATCTTGATG 

AATCAAGGGAAGGAAGAAGTTTATCATTTCAAGGATAAGATTTTCTATGGAAAGCAAGCTTTTGTGCGTGCCTTTA 

TGAAATCTTTGAATTTGAATAAGTCTGATTTGGTCATTCTCGATAGGGAGACAGGTATTGGACAGGTTGTGTTT 

GAAGCACAGACAGCACATCTAGCGGTAGTTGTTCATGCGGAGCATTATAGTGAAAATGCTACAAATGAGGACTAT 

ATCCTTTGGAATAACTATTATGACTATCAGTTTACCAATGCAGATAAGGTTGACTTCTTTATC 

ACAAAATGAAGTTCTACAAGAGCAATTTGCCAAATATACTCAGCATCAGCCAAAGATTGTTACCATTCCTGTAGGC 

AGTATTGATTCCTTGACAGATTCAAGTCAAGGGCGCAAACCATTTTCATTGATTACGGCTTCACGTCTTGCCAAAGA 

A A AGC AC ATTG ATTGGCTTGTG A A AGCTGTG ATTG A AGCTC AT A AGG AGTT ACCGG A ACT A ACCTTTG AT ATCT AT 

GGTAGTGGTGGAGAAGATTCTCTGCTTAGAGAAATTATTC^AAATCATCAGGCAGAGGACTATATCCAACTCAAGG 

GGCATGCGGAACTTTCGCAGATTTATAGCCAGTATGAGGTCTACTTAACGCK:TTCTACCAGCGAAGGATTTGGTCT 

GACCTTGATGGAAGCTATTGGTTCAGGTCTACCTCTAATTGGTTTTGATGTCTCCTTATGGTAATCAGACCTTTATAG 

AGGATGGGCAAAATGGTTATTTGATTCCAAGTTCATCTGACCATGTAGAAGACCAAATCAAGCAAGCTTATGCCGC 

TAAGATTTGTCAATTGTATCAAGAAAATCGTTTGGAAGCTATGCGTCK:CTATTCTTACCAAATTGCAGAAGC^TTCT 

TGACCAAAGAAATTTTAGAAAAGTGGAAGAAAACAGTAGAGGAGGTGCTCCATGATTGA 

(SEQ [D NO: 108) 

MTIYNINLGIGWASSGVEYAQAYRAGVFRKLNI^SKFIFTDMILADNIQHLTANIGFDDNQVIWLYNHFTDIKIAFTSVTV 

DDVLAYFGGEESHREKNGKVLRVFFFDQDKF=VTCYLVDENKX)LVQHAEYVFKGNLIRKDYFSYTRYCSEYFAPKDNVA 

VLYQRTFYNEDGTPVYDILMNQGKEEVYHFKDKIFYGKQAFVRAFMKSLNLNKSDLVILDRETGIGQVVFEEAQTAHL 

AVVVHAEHYSENATNEDYILWNNYYDYQFTNADKVDFFIVSTDRQNEVLQEQFAKYTQHQPKIVTIPVGSIDSLTDSSQ 

GRKPFSLITASRLAKEKHIDWLVKAVIEAHKELPELTFDIYGSGGEDSLLREIIANHQAEDYIQLKGHAELSQrYSQYEVYL 

TASTSEGFGLTLMEAIGSGLPLIGFDVPYGNQTFIEDGQNGYLIPSSSDHVEDQFKQAYAAKICQLYQENRLEAMRAYSY 

QIAEGFLTKEILEKWKKTVEEVLHD 



ID103 2292bp 

(SEQ ID NO: 109) 

atgtcctctctttcggatcaagaattagtack;taaaacagtagagtttcgtcagcgtctttccgagggagaaagtc 

tagacgatattttggttgaagcttttgctgtggtgcgtgaagcagataagcggattttagggatgtttccttatgat 

gttcaagtcatgckjagctattgtcatgcactatggaaatgttgctgagatgaatacgggggaaggtaagaccttga 

cagctaccatgcctgtctatttgaacgcrit'l'icaggagaaggagtgatggttgtgactcctaatgagtatttatca 

aagcgtgatgccgaggaaatgggtcaagtttatcgttttctaggattgaccattggtgtaccatttacggaagatc 

caaagaaggagatgaaagctgaagaaaagaagcttatctatgcttcggatatcatctacacaaccaatagtaatt 

taggttttgattatctaaatgataacctagcctcgaatgaagaaggtaagtttttacgaccgtttaactatgtgatt 

attgatgaaattgatgatatcttgcttgatagtgcacaaactcctctgatt attgc gggttctcctcgtgttcagtct 

aattactatgcgatcattgatacacttgtaacaaccttggtcgaaggagaggattatatctttaaagaggagaaag 

aggaggtttggctcactactaagggggccaagtctgctgagaatttcctagggattgataatttatacaaggaaga 

gc atgcgtcttttgctcgtc atttggttt atgcg attcg agctc at a agctcttt act a a ag at a agg act at atc a 

ttcgtggaaatgagatggtactggttgataagggaacagggcgtctaatggaaatgactaaacttcaaggaggtc 

tccatcack3ctattgaagccaaggaacatgtcaaattatctcctgagacgcgggctatggcctcgatcacctatca 

gagtctttttaagatgtttaataagatatctggtatgacagggacaggtaaggtcgcggaaaaagagtttattgaa 

acttacaatatgtctgtagtacgcattccaaccaatcgtccgagacaacggattgactatccagataatctatatat 

cactttacctgaaaaagtgtatgcatccttggagtacatcaagcaataccatgctaagggaaatcctttactcgttt 

ttgtaggctcagttgaaatgtctcaactctattcgtctctcttgtttcgtgaagggattgcccataatgtcctaaat 

gctaataatgcggcgcgtgaggctcagattatctccgagtcaggtcagatgggggctgtgacagtggctacctcta 

tggcaggacgtggtacggatatcaagcttggtaaaggagtcck:agack:ttgggggcttgattgttattgggactga 

gcggatggaaagtcagcggatcgacctacaaattcgtggccgttctggtcgtcagggagatcctggtatgagtaaa 

ttttttgtatccttagaggatgatgttatcaagaaatttggtccatcttgggtgcataaaaagtacaaagactatca 

ggttcaagatatgactcaaccggaagtattgaaaggtcgtaaataccggaaactagtcgaaaaggctcagcatgc 

cagtgatagtgctggacgttcagcacgtcgtcagactctggagtatgctgaaagtatgaatatacaacgggatata 

gtctataaagagagaaatcgtctaatagatggttctcgtgacttagaggatgttgttgtggatatcattgagagat 

at ac ag a ag aggt agcggctg atc act atgct agtcgtg a att attgtttc acttt attgtg acc a at att a gtttt 

catgttaaagaggttccagattatatagatgtaactgacaaaactgcagttcgtagctttatgaagcaggtgattg 

ataaagaactttctgaaaagaaagaattacttaatcaacatgacttatatgaacagtttttacgactttcactgctt 

a a agcc attg atg ac a actggg tag agc aggt ag act atct ac a ac agct atcc atggct atcggtggtc a atctg 

ctagtcagaaaaatccaatcgtagagtactatcaagaagcctacgcgggctttgaagctatgaaagaacagattc 

atgcggatatggtgcgtaatctcctgatggggctggttgaggtcactccaaaaggtgaaatcgtgactcattttcc 

ATA A 
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(SEQIDNO: 110) 

MSSLSDQELVAKTVEFRQRLSEGESLDDlLVEAFAVVREADKRILGMFPYDVQVMGAfVMHYGNVAEMNTGEGKTLTA 
TMPVYU^AFSGEGVMVVTPNEYIJJKRDAEEMGQVYRFLGLT 

LNDNLASNEEGKJ^RPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTXV^^ 
5 KSAENFT-GrDNLYKEEHASFARHLVYAIRAHKI-FTKDKJ^YIKGNEMVLVDKGTGRLMEMTKlXJG 

LSPETRAMASrrYQSLFKMFNKISGMTGTGKVAEK^FIETYNMSVVRlPTNRPRQRIDYPDNLYITLPEKVYASLEYIKQY 
HAKGNPLLVFVGSVEMSQLYSSLLFREG1AHNVLNANNAAREAQUSESGQMGAVTVATSMAGRGTDIKLGKGVAELG 
GLIVIGTERMESQRIDLQmGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKLV 
EKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDItERYTEEVAADHYASRELLFHFIVTNIS 
10 mVKEVPDYIDVTDKTAVRSFMKQVrDK£l^EKK£LLNQHDLYEQFLRI^I^ 
SQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFP 

ID104 879bp 

15 (SEQIDNO: 111) 

ATGAAACAAGAATGGTTTGAAAGTAATGATTTTGTAAAAACAACAAGCAAGAACAAGCCTGAAGAGCAAGCTCAA 
GAGGTTGCAGACAAGGCTGAAGAAAGGATACCCGATCTCGATACACCAATTGAAAAAAATACTCAGTTAGAGGAG 
GAAGTCTCTCAAGCTGAAGTCGAATTGGAAAGCCAGCAAGAAGAGAAAATTGAAGCTCCTGAAGACAGTGAACCG 
AGAACAGAAATAGAAGAAAAGAAGGCATCTAATTCTACTGAAGAAGAGCCAGACCTTTCTAAAGAAACAGAAAA 
20 AGTCACTATACKrTGAAGAGACK:CAAGAAGCTCTTCCTCAGCAAAAAGCAACCACGAAAGAGCCACTTCTTATCAG 
TAAATCTTTAGAAAGTCCTTATATCCCCGACCAAGCTCCAAAATCTAGGGATAAATGGAAAGAGCAAGTGCTTGAT 
TTTTGGTCTTGGCTAGTGGAAGCGATCAAATCTCCTACAAGTAAGTTGGAAACAAGTATCACACACAGTTACACAG 
CCTTTCTCTTGCTCATTCTGTTTTCTGCATCTTCCTTTTTCTTTAGTATCT^ 

TATAGC A AGCATTA AC AGTCGCTTCCCTG AGC AGCTAGCTCCTTTA ACTC 111111 CT ATC ATCTCTATCCT AGTAGC 
25 GACAACACTCTTCTTCTTTTCATTCCTCTTC 

CGCTAGACAAGGTTCTCCAACAATATAGTCAACTCTTGGCAATTCCAATCn'CCTCACTGCTATTGCTAGTTTCTTTG 
CTTTCTTTGATAGCCTACGATTTACAGCCCTCTTGTGTGTGA 

(SEQ ED NO: 112) 

30 MKQEWFESNDFVKTTSKNKPEEQAQEVADKAEERIPDLDTPIEKNTQLEEEVSQAEVELESQQEEKIEAPEDSEARTEIEE 
KKASNSTEEEPDLSK£TEKVT1AEESQEALPQQKATTKEPLLISKSLESPYIPDQAPKSRDKWKEQVLDFWSWLVEAIKSP 
TSKLETSITHSYTAFLLLILFSASSFFFSIYHIKHAYYGHIASINSRFPEQLAPLTLFSIISILVATTLFFFSFLLGSFVVRRFIHQ 
EKDWTLDKVLQQYSQLLAIPISSLLLLVSLLSLIAYDLQPSCV 

35 ID106 327bp 

(SEQIDNO: 113) 

atgtactttccaacatcctctgccttgattgaatttctcatcttggctgtactggagcagggtgattcttatggttat 
gagattagccaaaccattaagctgatcgctaatatcaaagaatccacactctatcccattctcaaaaaattggaag 
40 ck:aatagctttctgacaacctattctagagagttccaaggtcgcatgcgcaaatactactccttgacaaacggtgg 
tatagagcagctcttgaccctaaaagatgaatgggcactctatacagacaccatcaatggcatcatagaagggag 
tatccgccatgacaagaactga 

(SEQ ID NO: 114) 

45 MYFPTSSALIEFLILAVLEQGDSYGYEISQTIKI-IANIK£STLYPILKKI^GNSFLTTYSREFQGRMRKYYSLTNGGIEQLLT 
LKDEWALYTDT ING ILEGSIRHDKN 

ID108 954bp 

50 (SEQIDNO: 115) 

ATGGATTTTGAAAAAATTGAACAAGCTTATATCTATTTACTAGAGAATGTCCAAGTCATCCAAAGTGATTTGGCGA 
CCAACTTTTATGACGCCTTGGTGGAGCAAAATAGCATCTATCTGGATGGTGAAACTGAGCTAAACCAGGTCAAAGA 
CAACAATCAGGCCCTTAAGCGTTTAGCACTACGCAAAGAAGAATGGCTCAAGACCTACCAGTTTCTCTTGATGAAG 
GCTGGGCAAACAGAACCCTTGCAGGCCAATCACCAGTTTACACCGGATGCTATTGCTTTGCTTTTGGTGTTTATT 

55 GGAAGAGTTGTTTAAAGAGGAGGAAATTACTATCCTCGAAATGGGTTCTGGGATGGGAATTCTAGGCGCTATTTTC 
TTG ACCTC GCTT ACT A A A A AGG TGG ATT ACTTGGG A ATGG A AGTGG ATG ATTTGCTG ATTG ATCTGGC AGCT AGC A 
TGGCAGATGTAATnTGGTTTGCAGGCTGGCTTTGTCCAAGGAGATGCCGTTCGCCCACAAATGCTCAAAGAAAGCGA 
TGTGGTCATCAGTGACTTGCCTGTCGGCTATTATCCTGATGATGCCGTTGCGTCGCGCCATCAAGTTGCTTCTAGCC 
AAGAACATACTTACGCCCATCACTTGCTCATGGAACAAGGGCTTAAGTACCTCAAGTCAGACGGATACGCTATTTT 

60 TCTAGCTCCGAGTGATTTGTTGACCAGTCCTCAAAGTGATTTGTTAAAAGAATGGCTGAAAGAAGAGGCGAGTCTG 
GTTCK^TATGATTAGTCTGCCTGAAAATCTCTTTGCTAATGCCAAACAATCTAAGACTATTTTTATCTTACAGAAGAA 
AAATGAAATAGCAGTAGAGCCTTTTGTTTATCCACTTGCTAGCTTG^ 
AAAATTTTCAAAAATGGACTCAAGGTACTGAAATATAA 

65 (SEQIDNO: 116) 



56 



MDFEKIEQAYIYLLENVQVIQSDLATNFYDALVEQNSIYL^ KAG 
QTEPLQANHQFTPDAlALLLVFIVEELR<£EErril^MGSGMGILGAIFLTSLTKKVDYLGMEVDDLLIDLAASMADVIGL 
QAGFVQGDAVRPQMLKESDVVISDLPVGYYPDDAVASRHQVASSQEHTYAHHLLMEQGLKYLKSDGYAIFLAPSDLLT 
SPQSDIXKEWLK£EASLVAMISLPENLFANAKQSKTIF1LQKKNEIAV^^ 

IDn01902bp 

(SEQ ID N O: 117 ) 

atgattattttacaagctaataaaattgaacgttcttttc 
tgatgaacgagatcggattgctcttgttgcgaaaaatggtgcacot^ 

gaagaggagccaactagcggagaaatcaataagaaaaaagatatttctctgtcttacctagcccaagatagccgt 
tttgagtctgaaaataccatctacgatgaaatckrrtcatgtcittaatgatttgcgtcckja 
gtcaga tgga gctggagatgggtgaaaagtctggtgaggatttggataaactgatgtcagattatgaccgcttatc 
tgagaattttcgccaagcaggtcksctttacctatoaacktgat^ 

agtctatgtggcagatgaaaattgctgack:tttctggtggtcaaaatactcgtttggcacttgccaaaa 

gaaaagcccaatctcttggtcttggacgagccaactaaccacttggatattgaaaccatcgcctggctagagaatt 

acttggtaaactatagcggtgccctcattatcgtcagccacgaccgttatttcttggacaaggttgcgacaattacg 

ctagatttgaccaagcattccttggatcgctatgtggggaattactctcgttttgtcgaattgaaggagcaaaac^ 

agttactgaggcaaaaaactatgaaaagcaacagaaggaaatcgctgctctggaagactttgtcaatcgcaatct 

agttcgtgcttcaacgactaaacgtckrtcaatctcgccgtaaacaactagaaaaaatggagcgtttggacaagcct 

gaagctggcaagaaagcagccaacatgaccttccagtctgaaaaaacgtcgggcaatgttgttttgactgttgaaa 

atgcagctgttggctatgacggggaagtcttgtcacaacctatcaacctagatcttcgtaagatgaatgctgtcgc 

tatcgttggtccaaatggtatcggcaagtcaacctttatcaagtctattgtggaccagattccttttatcaagggag 

aaaack:gctttgc^ch:taatgttgaggttggttactatgaccaaacccaaagcaagctgacaccaagtaatacggt 

gctgg atg a actctgg a atg atttc a a actg ac acc ag a a gttg a a atccgc a accgtcttgg agccttccttttct 

caggagatgatgttaaaaaatcagtcggcatgctatctggtggcgaaaaagctcgtttgcttttagctaaattgtc 

tatggaaaacaataactttttgattctggatgagccgaccaaccacttggatattgatagtaaggaagtgctagaa 

a atgc cttgattgactttgatggaaccttgctgtttgtcagtcatgatcgttactttatcaatcgtgtggcaactcat 

gttttggaattgtctgagaatggttcaactctctaccttggagattacgactactatgttgagaagaaagcaacag 

cagaaatgagtcagactgaggaagcttcaactagcaatcaagcaaaggaagcaagtccagtcaatgactatcagg 

cccagaaagaaagtcaaaaagaagttcgcaaactcatgcgacaaatcgaaagtctagaagctgaaattgaagagc 

tagaaagtcaaagccaagccatttctgaacaaatgttggaaacaaacgatgccgacaaactcatggaattacagg 

ctgagctggacaaaatcagccatcgtcaggaagaagctatgcttgagtgggaagaattatcagagcaggtgtaa 

(SEQ ID NO: 118) 

MIILQANKIERSFAGEVLFDNINLQVDERDREALVGKNGAGKSTLLKILVGEEEPTSGEINKKKDISLSYLAQDSRFESENT 
IYDEMLHVFNDLRRTERQLRQMEL£MGEKSGEDLDKLMSDYDR1^ENFRQAGGFTYEADIRAILNGFKFDESMWQMKI 
AELSGGQNTRLALAKMLLEKPNLLVLDEPTNHLDIETIAWLENYLVNYSGALIIVSHDRYFLDKVATITLDLTKHSLDRY 
VGNYSRFVELK£QIO.VTEAK^YEKQQK£IAAL^DFVNRNLVRASTTKRAQSRRKQ 

SEKTSGNVVLTVENAAVGYDGEVLSQPINLDLRKMNAVAIVGPNGIGKSTFIKS1VDQIPFIKGEKRFGANVEVGYYDQT 
QSKI.TPSNWLDELWNDFKJ.TPEVEKNRLGAFLFSGDDVKKSVGMI^GGEKARLLLAKl^MENNNFLILDEPTNHLDID 
SKEVLENALIDFDGTLLFVSHDRYFINRVATHVLELSENGSTLYLGDYDYYVEKKATAEMSQTEEASTSNQAKEASPVN 
DYQAQK£SQK£VRKLMRQIESLEAEIEELESQSQAISEQMLETNDADKLMELQAELDKISHRQEEAMLEWEELSEQV 

ID111 1179bp 

(SEQ LD NO: 119) 

atg a atcgct atgc agtgc agttg att a gccgtggggct atc a at a a a atggg a a at atgctct atg att atgg a a 

atagtgtctggttggcttctatggggactataggacagacagttttaggaatgtatcagatttctgagctcgtcaca 

tctattctcgtcaatccctttggcggagttatttcagaccgrririctcgtcgtaagattttaatgacggcagatc^ 

gtttgtggg attctttgtctggc t atttctttc at a a gg a atg at agctgg atg attggcgctttg attgttgct a ac 

attgtgcaggctattgcttttgccttttctcgcacacccaataaagctatcataactgaagtggtggagaaagatg 

agattgtgatctataattctcgcttagagctggttttgcaggttgtaggtgttagctctcctgttctttccttcct 

tttt ac agtttgc a agtctcc at atg acgct actgct ag actcgctg ac 111111 c attgcttttgttctagtggctt 

tccttccaaaagaggaack:aaaagttcaagagaaaaagck:ttttactgggagagatatttttgtagatat^ 

atgggttacactatatctggcatcagcaagaaattttcttccttttgctggtagcttccagcgttaat^ 11 1 g 

cagcttttgaatttctacttcccttttcga 

ck3ggctattggttccatcattggggctcitctacct 

gctttg ac aggtgtcgg agttttt atg atggg att acc acttcc a acttttctttccttttctgg a a attt agtttgt 
gaattgtttatgacgatttttaatattcacttttttact^ 

agtactgagtacaatttttaccttagctattctatttatgcctattgcaaaaggatttatgacagtct^ 

tccatctttattctttcttgattattggacttggagttgtagccttatatttcttagctctcggatatgttcgaac 

attttgaaaaattgatataa 



(SEQ ID NO: 120) 
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MNRYAVQLISRGAINKMGNMLYDYGNSVWLASMGTIGQTVLGMYQISELVTSILVNPFGGV1SDRFSRRKILMTADLVC 

GILCI^ISFIRNDSWMIGALrVANIVQAlAFAFSRTANKAIITEVVEKDEIVIYNSRLELVLQVVGVSSPVLSFLVLQFASLH 

MTIJXDSLTFFIAFVLVAFLPKEEAKVQEKKAFTGRDIFVDIKDGLHYIW^ 

YGSEGAYASILTMGAIGSIIGALLASKIKANIYNIXILLALTGVGVFMMGIJ^ 

KVESEFLGRVI^TIFTI^ILFMPIAKGFMTVLPSVHLYSFLIIGLGVVALYFIJVLGYVRTHFEKLI 

IDU3 2466bp 

(SEQ ID NO: 121) 

ATGCAAAATCAATTAAATCAATTAAAACGAAAAATGCTra 

GCTAGACCTGGCAAGAAAGGTTCAAGTACCAAAAAATCTAAAACCrTAGATAAGTCAGCCATTTTCCCAGCTATTT 
T ACTG AGT AT A A A AGCCTT A TTT A ACTT ACTCTTTGT ACTCGGTTTTCT AGG AGG A ATGTTGGG AGCTGGG ATTGCT 

TTGGGATACGGAGTGGCCTTATTTGACAAGGTTCGGGTGCCTCAGACAGAAGAATTGGTGAATCAGGTCAAGGAC 

ATCTCTTCTATTTCAGAGATTACCTATTCGGACGGGACGGTGATTGCTTCCATAGAGAGTGATTTGTTGCGCACTTC 

TATCTCATCTGAGCAAATTTCGGAAAATCTGAAGAAGGCTATCATTGCGACAGAAGATGAACACTTTAAAGAACAT 

AAGGGTGTAGTACCCAAGGCGGTGATTCGTGCGACCTTGGGGAAATTTGTAGGTTTGGGTTCCTCTAGTGGGGGTT 

CAACCTTGACCCAGCAACTAATTAAACAGCAGGTGGTTGGGGATGCGCCGACCTTGGCTCGTAAGGCGGCAGAGA 

TTGTGGATGCTCTTGCCTTGGAACGCGCCATGAATAAAGATGAGATTTTAACGACCTATCTCAATGTGGCTCCCrrTT 

GGCCGAAATAATAAGGGACAGAATATTGCAGGGGCTCGGCAAGCAGCTGAGGGAATTTTCGGTGTAGATGCCAGT 

CAGTTGACTGTTCCTCAAGCAGCATTTTTAGCAGGACTTCCACAGAGTCCCATTACTTACTCTCCTTATGAAAATAC 

TGGGGAGTTGAAGAGTGATGAAGACCTAGAAATTGGCTTAAGACGGGCTAAGGCAGTTCTTTACAGTATGTATCGT 

ACAGGTGCATTAAGCAAAGACGAGTATTCTCAGTACAAGGATTATGACCTTAAACAGGACTTTTTACCATCGGGCA 

CGGTTACAGGAATTTCACGAGACTATTTATACTTTACAACTTTGGCAGAAGCTCAAGAACGTATGTATGACTATCTA 

GCTCAGAGAGACAATGTCTCCGCTAAGGAGTTGAAAAATGAGGCAACTCAGAAGTTTTATCGAGATTTGGCAGCC 

AAGGAAATTGAAAATGGTGGTTATAAGATTACTACTACCATAGATCAGAAAATTCATTCTGCCATGCAAAGTGCGG 

TTGCTGATTATGGCTATCTTTTAGACGATGGAACAGGTCGTGTAGAAGTAGGGAATGTCTTGATGGATAACCAAAC 

AGGTGCTATTCTAGGCTTTGTAGGTGGTCGTAATTATCAAGAAAATCAAAATAATCATGCCTTTGATACCAAACGTT 

cck;cagcttctactaccaagcccttck:tggcctacggtattgctattgaccaggck:to 

TCTATCTAACTATCCAACAAACTTTGCTAATGGCAATCCGATTATGTATGCTAATAGCAAGGGAACAGGAATGATG 

ACCTTGGGAGAAGCTCTGAACTATTCATGGAATATCCCTGCTTACTGGACCTATCGTATGCTCCGTGAAAAGGGTG 

TTGATGTCAAGGGTTATATGGAAAAGATGGGTTACGAGATTCCTGAGTACGGTATTGAGAGCTTGCCAATGGGTGG 

TGGTATTGAAGTCACAGTTGCCCAGCATACCAATGGCTATCAGACCTTAGCTAATAATGGAGTTTATCATCAGAAG 

CATGTGATTTCAAAGATTGAAGCAGCAGATGGTAGAGTGGTGTATGAGTATCAGGATAAACCGGTTCAAGTCTATT 

CAAAAGCTACTGCGACGATTATGCAGGGATTGCTACGAGAAGTTCTATCCTCTCGTGTGACAACAACCTTCAAGTC 

TAACCTGACTTCTTTAAATCCTACTCTGGCTAATGCAGATTGGATTGGGAAGACTGGTACAACCAACCAAGACGAA 

AATATGTGGCTCATGCTTTCGACACCTAGATTAACCCTAGGTGGCTGGATTGGGCATGATGATAATCATTCATTGTC 

ACGTAGAGCAGGTTATTCTAATAACTCTAATTACATGGCTCATCTCK3TAAATGCGATTCACK:AAGCTTCCCCAAGC 

ATTTGGGGGAACGAGCGCTTTGCTTTAGATCCTAGTGTAGTGAAATCGGAAGTCTTGAAATCAACAGGTCAAAAAC 

CAGAGAAGGTTTCTGTTGAAGGAAAAGAAGTAGAGGTCACAGGTTCGACTGTTACCAGCTATTGGGCTAATAAGTC 

aggagcck:cagcgacaagttatcck:tttgctattggcggaagtgatck:ggattatcagaatgcttggtctagtatt 

GTGGGGAGTCTACCAACTCCATCCAGCTCCAGCAGTTCAAGTAGTAGTTCTAGCGATAGCAGTAACTCAAGTACTA 
CACGACCTTCTTCTTCAAGGGCGAGACGATAA 

(SEQ ID NO: 122) 

MQNQLNELKRKML£FFQQKQKNKKSARPGKKCSSTKKSKTLDK^^ 

GVALFDKVRVPQTEELVNQVKI^ISSISEITYSDGTVIASIESDLLRTSISSEQISENLKKAIIATEDEHFKEHKGVVPKAVIRA 

TLGKFVGLGSSSGGSTLTQQLIKQQVVGDAPTLAR KAAEIVDALALERAMNKDEILTTYLNVAPFGRNNKGQNIAGARO 

AAEG1FGVDASQLTVPQAAFLAGLPQSPITYSPYENTGELKSDEDLEIGLRRAKAVLYSMYRTGALSKDEYSQYKDYDL 

KQDFLPSGTVTGISRDYLYITTLAEAQERMYDYLAQRDNVSAKELKNEATQKJ^RDLAAICEIENGGYKITTTIDOKIHS 

AMQSAVADYGYLLDDGTGRVEVGNVLMDNQTGAILGFVGGRNYQENQNNHAFDTKRSPASTTKPLLAYGIAIDQGLM 

GSETII^NYPTNFANGNPIMYANSKGTGMMTLGEALNYSWNIPAYWTYRMLREKGVDVKGYMEKMGYEIPEYGIESLP 

MGGGIEVTVAQHTNGYQTLANNGVYHQKHVISKIEAADGRVVYEYQDKPVQVYSKATATIMQGLLREVLSSRVTTTFK 

SNLTSLNPTLANADWIGKTGTTNQDENMWLMl^TPRLTLGGWIGHDDNHSLSRRAGYSNNSNYMAHLVNAIQOASPSI 

WGNERFALDPSVVKSEVLKSTGQKPEKVSVEGKEVEVTGSTVTSYWANKSGAPATSYRFAIGGSDADYONAWSSIVGS 
LPTPSSSSSSSSSSSDSSNSSTTRPSSSRARR 

ID1141974bp 

{SEQ ID NO: 123) 

A I2^ AAAArmATGTAAGTC 

A 1 I 1 I I GTTA AT A ATA ATCTGTTG ACGGTTTTAATTTTGTTTCTTTTTGTAGG AGGCTATGTTTTTTT ATTT A AG A A AC 
TGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGACAG 
CTCTATTGGAAC^ATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCTGGAGAGGTTGAGTGGTTTAATCCCTAT 

ck:tgaattgattttgaccaaggaagatggtgattttgatttagaagctgttcaaacgattatcaagc^^ 

GAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATCCT^ 

-^gtagatgtatccagggaacaagccataacagatgaattggtaacaagtagaccagtgattgggattgtctctg 
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tggataattatgatgatttggaggatgaaacttctgagtcagatattagtcaaatcaatagttttgtagctaatttt 
atatcagagttttcagaaaaacacatgatgttttctcgtcgggtaagtatggatcgattttatctatttactgacta 
cacggtgcttgagggcttgatgaatgataaattttctgttattgatgctttcagagaagagtcgaaacagagacag 
ttgcccttgaccttaagtatgggattttcttatggcgat^ 

tttgaacttggctgaagtacgtggtggcgaccaggtggttgttaaggaaaacgacgaaacgaaaaatccagtttat 
tttggtggtgggtctgctgcttcaatcaack^gtacacggact^ 

agattcggagtgtagatcaggtttttgtagtcggtcacaaaaatttagacatggatgctttgckx:t 
atgcagttgttcgccagcaatgtgattgaaaatagctatgctctttatgatgaagaacaaatgtctccagatattg 
aacgagctgtttcattcatagaaaaagaaggagttacgaagttgttgtctgttaaggatgcaatggggatggtgac 
caatcgttctttgttgattcttgtagaccattcaaagacac^ 

ccaaaccattgttattgaccaccatagaagggatcaggattttccagataatgcggttattacttatatcgaaagt 

ggtgcaagtagtgccagtgagttggtaacggaattgattcagttccagaattctaagaaaaatcgtttgagtcgta 

tgcaagk^aagtgtcnrtgatggctggtatgatgttggatactaaaaatttcacctcgcgagtaactagtcggacatt 

tgatgttgctagctatctcagaacgcgcggaagtgatagtattgctatccaggaaatcgctgcgacagattttgaa 

gaatatcgtgaggtcaatgaacttattttacaggggcgtaaattaggttcagatgtactaatagcagaggctaagg 

acatgaaatgctatgatacagttgttattagtaaggcagcagatgccatgttagccatgtcaggtattgaagcgag 

ttttgttcttgcgaagaatacacaaggatttatctctatctcagctcgaagtcgtagtaaactgaatgtacaacgga 

ttatggaagagttaggcggtggaggccactttaatttgck:agcagctcaaattaaagatgtaaccttgtcagaagc 

aggtgaaaaactgacagaaattgtattaaatgaaatgaaggaaaaggagaaagaagaatga 

(SEQ ID NO: 124) 

MKKFYVSPIFPILVGLIAFGVLSTFIEFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 

MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 

EQArTDELVTSRPVIGIVSVDNYDDI^DETSESDISQINSFVANRSEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 

FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASIKRTR 

TRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTKLL 

SVKDAMGMVTNRSIXILVDHSKTALTLSKEFYDLFTQTIVIDHHRRDQDFPDNAVITYIESGASSASELVTEL1QFQNSKK 

NRl^RMQASVUVfAGMMLDTKJNJFTSRVTSRTFDVASYLRTRGSDSIAIQElAATDFEEYREVNELILQGRKLGSDVLIAEA 

KI)MKCYDTVVISKAADAMLAMSGIEASFVLAKNTQGHSISARSRSK1J^VQR 

EKLTEIVLNEMKEKEKEE 

ID115 663bp 

(SEQ ID NO: 125) 

ATGAAdTGCTTGTTATGTGGGCAGACTATGAAGACTGririAACll 11 AGTAGTC TCTTACTTCTGAGGAATGATGA 

CTCTTGTCTTTGTTCAGACTGTGATTCTACTTTTGAAAGAATTGGGGAAGAGAACTGTCCAAATTGTATGAAAACAG 

AGTTGTCAACAAAGTGTCAAGATTGTCAACTTTGGTGTAAAGAGGGAGTTGAAGTCAGTCATAGAGCGATTTTTAC 

TT AC A ATC A AGCT ATG A AGG ATTTTTTC A GTC GGT AT A AGTTTG ATGG AG ACTTCCTGTT A AG A A A A GTTTTCGCTT 

CATTTTTAAGTGAGGAGTTGAAAAAGTACAAAGAGTATCAATTTGTTGTAATTCCCCTAAGTCCTGATAGATATGCT 

AATAGAGGATTTAATCAGGTTGAGGGCTTGGTAGAGGCAGCAGGCTTTGAGTATCTGGATTTATTAGAGAAAAGA 

GAAGAGAGAGCCAGTTCTTCTAAAAATCGTTCAGAGCCKJTTGGGGACAGAACTTCCTTTCTTTATTAAAAGTGGAG 

TCACTATTCCTAAAAAAATCCTACTTATAGATGATATCTATACTACAGGAGCAACTATAAATCGTGTTAAGAAACT 

GTTGGAAGAAGCTGGTGCTAAGGATGTAAAAACATTTTCCCTTGTAAGATGA 

(SEQ ID NO: 126) 

MKCLLCGQTMKTVLTFSSLLLLRNDDSCLCSDCDSTFERIGEENCPNCMKTELSTKCQDCQLWCKEGVEVSHRAIFTYN 
QAMKJ3FFSRYKFDGDFLLRKVFASFLSEELKKYK£YQFVVIPLSPDRYANRGFNQVEGLVEAAGFEYLDLLEKREERAS 
SSKNRSERLGTELPFFIKSGVTIPKKILLIDDIYTTGATINRVKKLLEEAGAKDVKTFSLVR 

ID116 1299bp 

(SEQ ID NO: 127) 

atgaaagtaaatttagattatctcggtcgtttatttactgagaatgaattaacagaagaagaacgtcagttggcgg 
agaaacttccagcaatgagaaaggagaaggggaaacttttctgtcaacgctgtaatagtactattctagaagaat 
ggtatttgcccatcggtc^ttactattgtcgagagtgcttckrrgatgaagcgagtcagaagtgatcaaactttatac 
tattttccgcaggaggattttccaaagcaagatgttctcaaatc^ 

tgtcagagggattgcttcaagtagtagacaagcaaaagccaaccttagttcatgcggtaacaggagctggaaaga 

cagaaatgatttatcaagtagtggctaaagtgatcaatgcgggtggtgcagtgtgtttggctagtcctcgcataga 

tgtttgtttggagctgtacaagcgcctgcaacaggatttttcttgcgggatagctttgctacatggagaatcggaac 

cttattttcgaacaccactagttgttgcaacaacccatcagttattgaagttttatcaagc'rimmgatttgctgatag 

tggatgaagtagatck:ttttccttatgttgataatcccatck:tttaccacgctgtcaagaatagtgtaaaggagaat 

ggattgagaatctttttaacagcgacttcgaccaatgagttagataaaaaggtccgtttaggagaactaaaaagac 

tgaatttaccgagacggtttcatggaaatccgttgattattccaaaa 

ttagacaagaatcgtttgtcaccaaagttaaagtcctatattgagaagcagagaaagacagcttatccgttactca 
tttttgcttcagaaattaagaaaggggagcagttagcagaaatcttacaggagcaatttccaaatgagaaaattgg 
ctttgtatcttctgtaacagaggatcgattagagcaagtacaagcttttcgagatggagaactgacaatacttatc 
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AGTACGACAATCTTGGAGCGCGGAGTTACCTTCCCTTGTGTGGATGTTTTCGTAGTAGAGGCCAATCATCGTTTGTT 
TACCAAGTCTAGTTTGATTCAGATTCKjTGGACGAGTTGGACGA 

TTCCATGATGGGTTAAATGCTTCAATCAAGAAGGCGATTAAGGAAATTCAGATGATGAATAAGGAGGCTGGTCTAT 
GA 

(SEQ ID NO: 128) 

MKVNLDYLGRLFTENELTEEERQLAEKJLPAMRKE 

QEDFPKQDVLKWRGQLTPFQEKVSEGLLQVVDKQKPTLVHAVTGAGKTEMIYQVVAKVINAGGAVCLASPRIDVCLEL 
YKRLQQDFSCGIALLHGESEPYFRTPLVVATTHQLLKJ^QAFDLLIVD 

ATSTNELDKKVRLGELKJ^LNLPRRFHGNPLIIPKPIWI^DFNRYLDKNRl^PKLKSYIEKQRKTAYPLLIFASEIKKGEQU^ 

EILQEQFPNEKIGFVSSVTEDRLEQVQAFRDGELTILISTTIL^RGVTFrcVDVFV^ 

TGDLLFFHDGLNASIKKAIKEIQMMNKEAGL 

ID117 870bo 

(SEQ ID NO: 129) 

atck;aaattcaaaaaagttttaagggck:agtctccctatggcaagctgtatctagtgck:aacgccgattggcaatc 
t ag atg at atg acttttcgtgct atcc ag accttg a a ag a agtgg actgg attgctgctg agg at acgcgc a at ac 
agggcttttgctcaagcattttgacatttccaccaagcagatcagttttcatgagcacaatgccaagga 
cctg atttg attggtttcttg a a agc agggc a a agt attgctc aggtctctg atgccggtttgcct agc atttc ag a 
ccctggtcatgatttagttaaggcagctattgaggaagaaattgcagttgtgacagttccaggtgcctctgcagga 
atttctgccttgattgccagtggtttagcgccacagccacatatcttttacgg it 1 11 i accg aga a a atcaggtc a 
gcagaagcaa'itl i i igc^ttgaaaaaagattatcctgaaacacaga l l l'i ri atgaatcacctcatcgtgtagca 
g ac acgttgg a a a at atgtt a g a agtct acggtg accgctccgttgtcttggtc aggg a attg acc a a aatct atg 
aagaataccaacgaggtactatctctgagttattagaaagcattgctgaaacgccactcaagggcgaatgtcttct 
c attgttg agggtgcc agtc agggtgtgg a gg a a a agg ac g agg a ag acttgttcgt ag a a attc a a acccgc at 
ccagcaaggtgtgaagaaaaaccaagctatcaaggaagtcgctaagatttaccagtggaataaaagtcagctcta 
cgctgcctaccacgactgggaagaaaaacaataa 

(SEQ ID NO: 130) 

MQIQKSFKGQSPYGKXYLVATPIGNLDDMTFRAIQTLK£VDW1AAEDTRNTGLLLKHFDISTKQISFHEHNAKEKIPDLIG 
FLFCAGQS LAQVSD AGLPS ISDPGH DLV KA A IEEEIA V VTVPG AS AGIS ALI ASGLAPQPH IFYGFLPR KSGQQ KQFFGLKKD 
YPETQIFYESPHRVADTLENMLEVYGDRSVVLVRELTKIYEEYQRGTISELLESIAETPLKGECLLIVEGASQGVEEKDEE 
DLFVEIQTR1QQGVKKNQAIKEVAKIYQWNKSQLYAAYHDWEEKQ 

ID118 345bp 

(SEQ ID NO: 131) 

ATGATAAAGAAAGGAAAGGGCTGTTTTATGGACAAAAAAGAATTATTTGACGCGCTGGATGATTTTTCCCAACAAT 
TATTGGTAACCTTAGCCGATGTGGAAGCCATCAAGAAAAATCTCAAGAGCCTGGTAGAGGAAAATACAGCTCTTCG 
CTTGGAAAATAGTAAGTTGCGAGAACGCTTGGGTGAGGTGGAAGCAGATCKrrCCTGTCAACKjCCAAGCATGTTCG 
CG A A AGTGTCCGTCGT ATTTACCGTG ATGG ATTTC ACGT ATGTAATG A 1 rill ATGG AC A ACGTCG AG AGC AGG AC 
GAAGAATGTATGTTTTGTGACGAGTTGTTATACAGGGAGTAA 

(SEQ ID NO: 132) 

MIKKGKGCFMDKKELFDALDDFSQQLLVTLADVEAIKKJNJLKSLVEENTALRl^NSKU^ERLGEVEADAPVKAKHVRES 
VRRIYRDGFHVCNDFYGQRREQDEECMFCDELLYRE 

ID119 639bp 

(SEQ ID NO: 133) 

atgtc a a a agg a 11111 agtctctcttg aggg acc ag aggg agc aggc a ag acc agtgttttag aggctctgctac 

caattttagaggaaaaaggagtagaggtgttgacgacccgtgaacctggcggagtcttgattggggagaagattc 

gggaagtgattttggatccaagtcatactcagatggatgctaaaacagagctacttctctatattgccagtcgcag 

acagcatttggtggaaaaagttcttccagcccttgaagctggcaagttggtcatcatggatcgttttatcgatagtt 

ctgttgcctatcagggatttggtcgtggcttagatattgaagccattgactggctcaatcagtttgcgacagatggc 

ctcaaacccgatttgacactctattttgacatcgaggtggaagaagggctggctcgtattgctgctaatagtgacc 

gcg aggtt a atcgtttgg atttgg a agggttgg acttgc ata aaa a agttcgtc a aggct acctttctcttctgg at 

aaagagggaaatcck:attgtcaagattgatgctagtctccctttggagcaagttgtggaaactaccaaggctgtct 

tgtttg acgg a atgggcttggcc a a atg a 

(SEQ ID NO: 134) 

MSKGFLVSLEGPEGAGKTSVLEALLPILEEKGVEVLTTREPGGVLIGEKIREVILDPSHTQMDAKTELLLYIASRRQHLVE 
KVLPALEAGKLVIMDRFIDSSVAYQGFGRGLDIEAIDWLNQFATDGLKPDLTLYFDIEVEEGLARIAANSDREVNRLDLE 
GLDLHKKVRQGYLSLLDKEGNRIVKIDASLPLEQVVETTKAVLFDGMGLAK 



60 



ID120 408bp 

(SEQ tDNO: 135) 

atggtagaacaaagaaaatcaattaccatgaaagatgttgctttagaagcaggagt^^ 

gtgtaattaataaagaaaaaggcattaaagaagtaactrttgaaaaaagtggaacaagcgattaaaa(trttgaat^ 

acattccagattactacgctagaggaatgaaaaaaaatcgaacagaaacgattck^aatcattgtaccaagtatct 

ggcatcccntctitttcagaatttgctatgcatgtggaaaatgaagtctataagagaaataacaaattac^ 

tctatcaatck}tacaaatagagagcaagactatctggagatgttgcgtcataataaagttgatggagtggttgcca 

ttacctataggccaattgaacattacttgacgtcaggaattccctttgttagtattgaccck:acatactc 

GCCATTCCTTGTGTTTCA 
(SEQ ID NO: 136) 

MVEQRKSITMKDVAI^AGVSVGTVSRVINKEKGIKEVTLKKVEQAIKTLNYIPDYYARGMKKNRTETIAllVPSIWHPFFS 
EFAMHVENEVYKRNNKLLLCSINGTNREQDYLEMLRHNKVDGVVAITYRPIEHYLTSGIPFVSrDRTYSEIAlPCVS 

ID121 285bp 

(SEQ ID NO: 137) 

atgaatatatttagaacaaagaatgttagtttagataaaacagagatgcataggcatttgaagttatgggatttga 

ttttgctgggtatcggagccatggtagggacagck:gtctttacaatcacaggtactck:agctgcaacacttgctgg 

cccagccctagtgatttcaatcgttatttctgccttgtgtgtgggattatcagccctcttttttgca 

ck:gagtacccck:tacaggaggtck:ctatagttacctctatck:tatcttaggagaattccctgcctggttgck:tggtt 

ggttaaccatgatggagttcatgacack:catatcaggcgtagcttcgggttgggcagcttattttaa 

(SEQ ID NO: 138) 

mnifrtkj^vsldktemhrhlki.wdlillgigamvgtgvftitgtaaatlagpalvisivisalcvglsalffaefasrvpa 
tggaysylyailgefpawlagwltmmefmtaisgvasgwaayf 

ID124 131 lbp 

(SEQ ID NO: 139) 

atgaaatcaagagtaaaggaaacgagtatggataaaattgtggttcaaggtggcgataatcgtctggtaggaagc 

gtgacgatcgagggagcaaaaaatgcagtcttacccttgttggcagcgactattctagcaagtgaaggaaagacc 

gtcttgcagaatgttccgattttgtcggatgtctttattatgaatcaggtagttggtggtttgaatgccaaggttga 

ctttgatgaggaagctcatcttgtcaaggtggatgctactggcgacatcactgaggaagccccttacaagtatgtc 

agcaagatgcgcgcctccatcgttgtattagggccaatccttgcccgtgtgggtcatgccaaggtatccatgccag 

gtggttgtacgattggtagccgtcctattgatcttcatttgaaaggtctggaagctatgggggttaagattagtcag 

acagctggttacatcgaagccaaggcagaacgcttgcatggtgctcatatctatatggactttccaagtgttggtg 

caacgcagaacttgatgatggcagcgactctggctgatggggtgacagtgattgagaatgctgcgcgtgagcctg 

agattgttgacttagccattctccttaatgaaatgggagccaaggtcaaaggtgctggtacagagactataaccat 

tactggtgttgagaaacttcatggtacgactcacaatgtagtccaagaccgtatcgaagcaggaacctttatggta 

gctgctgccatgactggtggtgatgtcttgattcgagacgctgtctgggagcacaaccgtcccttgattgccaagtt 

acttgaaatgggtgttgaagtaattgaagaagacgaaggaattcgtgttcgttctcaactagaaaatctaaaagct 

gttcatgtgaaaaccttgccccacccaggatttccaacagatatgcaggctcaatttacagccttgatgacagttg 

caaaaggcgaatcaaccatggtggagacagttttcgaaaatcgtttccaaacctagaagagatgcgccgcatggg 

cttgcattctgagattatccgtgatacagctcgtattgttggtggacagcctttgcagggagcagaagttctttcaa 

ctgaccttcgtgccagtgcggccttgattttgacaggtttggtagcacagggagaaactgtggtcggtaaattggt 

tcacttggatagaggttactacggtttccatgagaagttgck:ck:agctaggtgctaagattcagcggattgaggca 

agtgatgaagatgaataa 

(SEQ ID NO: 140) 

MKSRVK£TSMDKIVVQGGDNRLVGSVTIEGAKJNIAVLPLLAATILASEGKTVLQNVPILSDVFIMNQVVGGLNAKVDFD 

EEAHLVKVDATGDrTEEAPYKYVSKMRASIVVLGPILARVGHAKVSMPGGCTIGSRPIDLHLKGLEAMGVKISQTAGYIE 

AKAERLHGAHIYMDFVSVGATQNUV1MAATLADGVTVIENAAREPEIVDLAILLNEMGAKVKGAGTETITITGVEKLHGT 

THNVVQDRIEAGTFMVAAAMTGGDVLIRDAVWEHNRPLIAKLLEMGVEVIEEDEGIRVRSQLENLKAVHVKTLPHPGF 

PTDMQAQFTALMTVAKGESTMVETVFENRFQHLEEMRRMGLHSEIIRDTARIVGGQPLQGAEVLSTDLRASAALILTGL 

VAQGETVVGKLVHLDRGYYGFHEKLAQLGAKIQRIEASDEDE 

ID125 HOlbp 

(SEQ ID NO: 141) 

ATGTTATTAGCGTCAACAGTAGCCTTGTCATTTGCCCCAGTATTGGCAACTCAAGCAGAAGAAGTTCTTTGGACTGC 
ACGTAGTGTTGAGCAAATCCAAAACGATTTGACTAAAACGGACAACAAAACAAGTTATACCGTACAGTATGGTGA 
T ACTTTG AGC ACC ATTGC AG A AGCCTTGGGTGT AG ATGTC AC AGTGCTTGCG A ATCTG A AC A A A ATC ACT A AT ATG 
GACTTGATTTTCCCAGAAACTGTTTTGACAACGACTGTCAATGAAGCAGAAGAAGTAACAGAAGTTGAAATCCAAA 
CACCTCAAGCAGACTCTAGTGAAGAAGTGACAACTGCGACAGCAGATTTGACCACTAATCAAGTGACCGTTGATG 
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atcaaactgttcaggttgcagacctttctcaaccaattgcagaagttacaaagacagtgattgcttctgaagaagt 

ggcaccatctacgggcacttctgtcccagaggagcaaacgaccgaaacaactcgcccagttgcagaagaagctcc 

tcaggaaacgactccagctgagaagcaggaaacacaaacaagccctcaagctgcatcagcagtggaagcaactac 

aacaagttcagaagcaaaagaactagcatcatcaaatggagctacagcack:agtttctacttatcaaccagaaga 

aacgaaagtaatttcaacaacttacgagcctccack:tg^ 

aatgcaggtcttcaaccacaaacackrrc^crrttaawgaagaaattgctaacttgtt^ 

gttatcgtccaggagacagtggagatcacck3aauaaggtttggctatcgactttatggtaccagaacgttcagaatt 
aggggataagattgcggaatatgctattcaaaatatggccagccgtggcattagttacatcatctggaaacaacgt 
ttctatgctccattcgatagcaaatatgggccagctaacacttggaacccaatgccagaccgtggtagtgtgacag 
aaaatcactatgatcacgttcacgtttcaatgaatggataa 

(SEQ ID NO: 142) 

MLLASTVALSFAPVLATQAEEVLWTARSVEQIQNDLTKTDNKTSYTVQYGDTL^TIAEALGVDVTVIJ^NLNKITNMDLI 
FPETVLTTTVNEAEEVTEVEIQTPQADSSEEVTTATADLTTNQVTVDDQ 

PEEQTTETTRPVAEEAI^ETTPAEKQETQTSPQAASAVEATTTSSEAKEVASSNGATAAVSTYQPEETKVISTTYEAPAAP 
DYAGLAVAKSENAGLQPQTAAFKKKl.LTCl^LHPLVVrVQETVEITEKVWI^TLWYQNVQNZGIRLRNMLFKIWPAVA 
LVTSSGNN VSM LHS I ANM GQLTLGTQCQT V V VZQKITM ITFTFQZM D 

ID126 1281bp 

(SEQ ID NO: 143) 

ttgtttaagaaaaataaagacattcttaatattgcattgccagctatgggtgaaaac1 iii 1 gc ag atgcta atgg 
g a atck3tgg ac agtt atttggttgctc attt agg attg at agct atttc aggggtttc agt ac^ 
accatttatcaggcgattttcatcgctch^ggagctgctatttc 

accagtcgaagttgck^ctatcatgtgactgaggcgttgaagattaccttactattaagtttccttttagga'im'lm'rg 

tccatcttcgctgggaaagagatgataggacttttggggacggagagggatgtagctgagagtggtggactgtat 

ctatctttggtaggcggatcgattgttctcttaggtttaatgactagtctaggagccttgattcgtgcaacgcataa 

tcc acgtctgcctctct atgttag' i " 1 ' i " i " i " i atcc a atgccttg a atattc 1 " i'l " 1"! ' i'c a agtct agct a' i" i " i" i " i gttctg 

g at atgggg at agctggtgttg cttggggg ac a attgtgtctcgtttggttggtcttgtg attttgtggtc ac a att 

aaaactgccttatgggaagccaacttttggtttagataaggaactgttgaccttggctttaccagcagctggagag 

cgacttatgatgagggctggagatgtagtgatcattgccttggtcgtttcttttgggacggaggcagttgctggga 

atgcaatcggagaagtcttgacccagtttaactatatgcctgcctttggcgtcgctacggcaacggtcatgctgttg 

gcccgagcagttggagaggatgattggaaaagagttck:tagtttgagtaaacaaaccttttggctttctctgttcc 

tcatgttgcccctgtccittagtatatatgtcttgggtgtaccattaactcatctctatacgactgattctctagcgg 

tggaggctagtgttctagtgacactgttttcactacttgggacccctatgacgacaggaacagtcatctatacggc 

agtctggcagggattaggaaatgcacgcctccctttttatgcgacaagtataggaatgtggtgtatccgcattggg 

acaggatatctgatggggattgtgcttggttggggcttgcctggtatttgggcagggtctctcttggataatggttt 

tcgctggttatttctacgctatcgttaccagcgctatatgagcttgaaaggatag 

(SEQ ID NO: 144) 

LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGNIITIYQAIFIALGAAISSVISKSICK^KJDQSKLA 
Y H VTE A LK IT LLLS FLLG FLS I FAG KEM IG LLGTER D V AES GG L Y LS L VGG S I V LLG LM TS LG A L I R ATH N PR LPL Y VS FLS 
NALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQLKI-PYGKPTFGLDKELLTLALPAAGERU^MRAGDVVIIAL 
VVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWLSLFLMLPLSFSIYVLGVPL 
THLYTTDSLAVEASVLVTLFSU-GTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRIGTGYLMGIVLGWGLPGIWA 
GSLLDNGFRWLFLRYRYQRYMSLKG 

1D127 894bp 

(SEQ ID NO: 145) 

gtgggaagaattatcagagcaggtgtaaagatggaacatcttggaaaagtatttcgtgaatttcgaacaagtgga 

aattattctttaaaggaagcagcagck:gaatcctgctctacctctcagttatctcgctttgagcttggggagtctga 

cctgck:agtctcccgtttctttgagattttggataacattcat^ 

attttcataatcatgaacatgtgtctatgatggcacagattatcccactttactattcaaacgatattgcaggtttt 

c a a a agcttc a a ag ag a ac a a cttg a a a agtct a ag agttcg a cg actccccttt attttg agctg a actgg attt 

tgctacaaggtctgatttgtcaaagagatgcgagttatgatatgaagcaggatgatttgggtaaggtagcagatta 

tctcttcaaaacagaagaatggaccatgtatgagttgattcttttcggtaacctctatagtttctacgatgtagact 

atgtcactcggattggtagagaagttatggagagggaggaattttaccaagagattagtccccataagagattag 

tgttg attttggccctc a attgtt acc agc attgttt a g agc attcttc i" i " i'l " i " at a atgcc a act a ' i" i'l " i g aggctt 

atacagagaagattattgacaaaggtattaagctttatgacx:gtaatgtttt^ 

tatcaaaaaggacagtgtaaagaaggctgtaagcagatgcaagaggccatgcatatttttgatgtgttaggtcttc 
cagagcaagtagcctattatcaggaacactacgaaaaatttgtcaaaagttaa 

(SEQ ID NO: 146) 

VGRIIRAGVKA1EHLGKVFKEFRTSGNYSLKEAAGESCSTSQLSRFELGESDLAVSRFFEILDNIHVTIENFMDKARNFHNH 
EHVSMMAQIIPLYYSNDIAGFQKLQREQLEKSKSSTTPLYFELNWILLQGLICQRDASYDMKQDDLGKVADYLFKTEEW 
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TMYELILFGNLYSFYDVDYVTRIGREVMEREEFYQEISRHKRLVLILALNCYQHCLEHSSFYNANYFEAYTEKIIDKGIKL 
YERNVFHYLKGFALYQKGQCKEGCKQMQEAMHIFDVLGLPEQVAYYQEHYEKFVKS 
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TABLE 3 

ID1 J 068 hp 

5 (SEQ ID NO: 147) 

ATGTCTAACATTCAAAACATGTCCCTCXjATO 

aagaccgggctttgccagatattcgtgatgggttgaagccggttcagcgccgtattctttattctatgaataaggat 
agcaatacttttgacaagagctaccgtaagtcggccaagtcagtcgggaacatcatggggaatttccacccacacg 
gggattcttctatctatgatgccatggttcgtatgtcacagaactggaaaaatcgtgagattctagttgaaatgca 

10 cggtaataacggttctatggacggagatcctcctgcggctatgcgttatactgaggcacgtttgtctgaaattgca 
gck:taccttcttcaggatatcgagaaaaagacagttccttttgcatggaactttgacgatacggagaaagaaccaa 
cggtcttgccagcagcctttccaaacctcttggtcaatggttcgactgggatttcggctggttatgccacagacatt 
cctccccataatttagctgaggtcatagatgctgcagtttacatgattgaccacccaactgcaaagattgataaac 
tcatggaattcttgcctggaccagacttccctacaggggctattattcacggtcgtgatgaaatcaagaaagctta 

1 5 - tgagactgggaaagggcgcgtggttgttcgttccaagactgaaattgaaaagctaaaaggtggtaaggaacaaat 
cgttattattgagattccttatgaaatcaataaggccaatctagtcaagaaaatcgatgatgttcgtgttaataac 
aaggtagctgggattgctgaggttcgtgatgagtctgaccgtgatggtcttcgtatcgctatcgaacttaagaaag 
acgctaatactgagcttgttctcaactacttatttaagtacaccgacctacaaatcaactacaactttaatatggtg 
ch:gattgacaatttcacacctcgtcaggttggattgttccaatcctgtctagctatatcgctcaccgtcgagaagtg 

20 a 

(SEQ ID NO: 148) 

MSNIQNMSLEDIMGERFGRYSKYUQDRALPDIRDGLKPVQRRILYSMNICDSNTFDKSYRKSAKSVGNIMGNFHPHGDSS 
lYDAMVRMSQNWKNREILVEMHGNNGSMDGDPPAAMRYTEARI^EIAGYLLQ 
25 AFPNLLVNGSTGISAGYATDIPPHNLA^VIDAAVYMEDHPTAKIDKLMEFLPGPDPTTGAIIQGRDEIKKAYETGKGRVVV 
RSKTEIEK1-KGGK£QIVI1EIPYEINKANLVKKIDDVRVNNKVAGIA£VRDESDRDGLRIAIELKICDANTELVLNYLFKYT 
DLQINYNFNMVAIDNFTPRQVGLFQSCLAISLTVEK 

ID12 684bo 

30 

(SEQ ID NO: 149) 

ATGCCGACATTAGAAATAGCACAAAAAAAACTGGAGTTCATTAAGAAGGCAGAAGAATATTACAATGCCTTGTGT 
ACAAATATACAGTTGAGCGGAGATAAACTAAAAGTAATTTCCGTTACTTCTGTTAACCCTGGGGAAGGAAAAACA 
ACTACTTCCATAAATATAGCATGGTCGTTTGCGCGTGCAGGCTATAAAACTCTTTTGATCGATGGCGATACTCGAAA 

35 TTC AGTT ATGTTAGGAG I" I " I " 1"I AA ATCTCGTG AA AA AATTAC AGGGCTA AC AG A ATTTTTATCTGGG AC AGCTG ATT 

TATCTCACGGTTTATGTGATACAAATATTGAAAATTTATTTGTAGTTCAATCGGGATCTGTATCACCAAACCCTACA 
GCCTTGTT AC A A AGT A A A A ATTTT A ATG AT ATG ATTG A A AC ATTGC GT A A AT ATTTTG ATT AT ATC ATT ATTG AT AC 
ACCGCCT ATTGG A ATTGTT ATTG ATGCGGC A ATT ATC ACTC A A A A GTGTG ATGCGTCC ATCTTGGT A AC AGC A AC A 
GGTGAGGCGAATAAACGTGATATCCAAAAAGCGAAACAACAATTAAAACAAACAGGGAAACTGTTCCTAGGAGTT 

40 GTTTTAAATAAATTGGATATCTCGGTTAATAAGTATGGAGTTTACGGTTCCTATGGAAATTATGGTAAAAAATAA 

(SEQ ID NO: 150) 

MPTLEIAQKJO^FIKKAEEYYNAIXTNIQl^GDKiKVISVTSVNPGEG 

VFKSREKrTGLTEFl^GTADI^HGLCDTNIENLFVVQSGSVSPNPTALLQSKJVFNDMIETLRKYFDYniDTPPIGIVIDAAII 
45 TQKCDASILVTATGEANKRD1QKAKQQLKQTGKLFLGVVLNKLDISVNKYGVYGSYGNYGKK 

ID13 1182bo 

(SEQ ID NO: 151) 

50 atgg aggcaaatatgaaacatctaaaaacattttacaaaaaatggtttcaattattagtcgttatcgtcattagctt 
ttttagtggagccttgggtagtttttcaataactcaactaactcaaaaaagtagtgtaaacaactctaacaacaat 
agtactattacacaaactgcctataagaacgaaaattcaacaacacaggctgttaacaaagtaaaagatgctgtt 
gtttctgttattacttattcggcaaacagacaaaatagcgtatttggcaatgatgatactgacacagattctcagcg 
aatctctagtgaaggatctggagttatttataaaaagaatgataaagaagcttacatcgtcaccaacaatcacgtt 

55 attaatggcgccagcaaagtagatattcgattgtcagatgggactaaagtacctggagaaattgtcggagctgac 
actttctctgatattgctgtcgtcaaaatctcttcagaaaaagtgacaacagtack:tgagtttggtgattctagtaa 
gttaactgtaggagaaactgctattgccatcggtagcccgttaggttctgaatatgcaaatactgtcactcaaggt 
atcgtatccagtctcaatagaaatgtatccttaaaatcggaagatggacaackztatttctacaaaagccatccaaa 
ctgatactgctattaacccaggtaactctggcggcccactgatcaatattcaagggcaggttatcggaattacctc 

60 aagtaaaattck:tacaaatggaggaacatctgtagaaggtcttggtttcgcaattcctgcaaatgatgctatcaat 
att attgaac agtt agaaaaaaacggaaaagtgacgcgtccagctttgggaatccagatggttaatttatcta atg 
tgagtacaagcgacatcagaagactcaatattccaagtaatgttacatctggtgtaattgttcgttcggtacaaag 
taatatgcctgccaatggtcaccttgaaaaatacgatgtaattacaaaagtagatgacaaagagattgcttcatca 
acagacttacaaagtgctctttacaaccattctatcggagacaccattaagataacctactatcgtaacgggaaag 

65 aagaaactacctctatcaaacttaacaagagttcaggtgatttagaatcttaa 
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(SEQ ID NO: 152) 

MEANMKHLKTFYKKWFQLLVVIVISFFSGALGSFS 

ITYSANRQNSVFGNDDTDTDSQRISSEGSGVIYKKiNlDKEAYIVTNNHVINGASKVDrRLSDGTKVPGEIVGADTFSDIAVV 
KISSEKVTTVAEFGDSSKLTVGETAIAIGSPLGSEYANTVTQGIVSSLNRNVSLKSEDGQAISTKAIQTDTAINPGNSGGPLI 
5 NIQGQVIGITSSK1ATNGGTSVEGLGFAIPANDAINIIEQLEKNGKVTRPALGIQMVNLSNVSTSDIRRLNIPSNVTSGVIVR 
SVQSNMPANGHLEKYDVITKVDDICEIASSTDLQSALYNHSIGDTIKITYYRNGKEETTSIKLNKSSGDLES 

ID15 939bp 

10 (SEQ fD NO: 1 53) 

atggcagaaatttatctagcaggtggttgtttttgggtc 

aaccagtgttggctacgctaatggtcaagtcgaaacgaccaattaccagttgctcaaggaaacagaccatgcaga 

aacggtccaagtgatttacgatgagaaggaagtgtcactcagagagattttactttattatttccgagttatcgatc 

ctctatctatcaatcaacaagggaatgaccgtggtcgccaatatcgaactgggatttattatcaggatgaagcaga 

15 tttgccagctatctacacagtggtgcaggagcaggaacgcatgctgggtcgaaagattgcagtagaagtggagca 
attacgccactacattctggctgaagactaccaccaagactatctcaggaagaatccttcaggttactgtcatatc 
gatgtgaccgatgctgataagccattgattgatgcagcaaactatgaaaagcctagtcaagaggtgttgaaggcc 
agtctatctgaagagtcttatcgtgtcacacaagaack:tgctacagagck:tccatttaccaatgcctatgaccaaa 
cctttgaagaggggatttatgtagatattacgacaggtgagccactcttttttgccaaggataagtttgcttcaggt 

20 tgtggttggccaagttttagccgtccgatttccaaagagttgattcattattacaaggatctgagccatggaatgg 
agcgaattgaagttcgttctcgttcaggcagtgctcacttgggtcatgttttcacagatggaccgcgggagttagg 
cckjcctccgttactgtatcaattctck:ttctttacgctttgtggccaaggatgagatggaaaaagcaggatatgc^ 
atctattgccttacttaaacaaataa 

25 (SEQ ID NO: 154) 

maeiyijvgck:fwgi^eyfsrisgvl^tsvgyangqvettnyqllket^ 

qqgndrgrqyrtgiyyqdeadlpaiytvvqeqermlgrkiaveveqlrhyilaedyhqdylrknpsgychedvtdadk 
plidaanyekpsqevlkasi^eesyrv^qeaateapftoayix^tfeegiyvdittgeplffakdkfasgcgwpsfsrpis^ 
lihyykdlshgmertevrsrsgsahlghvftdgprelgglrycinsaslrfvakdemekagygyllpylnk 

30 

ID17 870bp 

(SEQ ID NO: 155) 

atgaagattattgtacctgcaaccagtgccaatatcggck:caggttttgactcggtcggtgtagctgtaaccaagt 
35 atcttcaaattgaggtctck:gaagaacgagatgagtggctgattgaacaccagattggcaaatggattccacatga 
cgagcgtaatctcttgctcaaaatcgctttcm^aaattgtaccagacttgcaaccaagacgcttgaaaatgaccagt 
gatgtccctttggcgcgcck3tttgggttcttccagctcggttatcgttgctgggattgaactagccaa 
tcaactcaacttatcagaccatgaaaaattgcagttagcgaccaagattgaagggcatcctgacaatgtggctcca 
ck:catttatggtaatctcgttattgcaagttctgttgaagggcaagtctctck^tatcgtagcagactttccagagtg 
40 tgattttctactcttacattccaaactatgaattacgtactcgcgacagccgtagtgtcttgcctaaaaaattgtctt 
ataaggaagctgttgctgcaagttctatcgccaatgtagcggttgctgccttgttggcaggagacatggtgaccgc 
tgggcaack:aatcgagggagacctcttccatgagcgctatcgtcaggacttggtaagagaatttgcgatgattaag 
c a agtg acc a a ag a a a atggggcct atgc a acct acctttctggtgctgggcc g ac agtt atggttctggcttctc 
atgacaagatc^caacaattaackx:agaattggaaaagcaacctttcaaaggaaaactgcatgacttgagagttg 
45 atacccaaggtgtccgtgtagaagcaaaataa 

(SEQ ID NO: 156) 

MKIIVPATSANIGPGFDSVGVAVTKYLQIEVCEERDEWLIEHQIGKWIPHDERNLLLKIALQIVPDLQPRRLKMTSDVPLA 
RGLGSSSSVIVAGIELANQLGQLNLSDHEKLQLATKIEGHPDNVAPAIYGNLVIASSVEGQVSAIVADFPECDFLAYIPNY 
50 ELRTRDSRSVLPKKLSYKEAVAASSIANVAVAALLAGDMVTAGQAIEGDLFHERYRQDLVREFAMIKQVTKENGAYAT 
YLSGAGPTVMVLASHDKMPTIKAELEKQPFKGKLHDLRVDTQGVRVEAK 

ID20 564bp 

55 (SEQ ID NO: 157) 

atgaaatatcacgattacatctgggatttaggtggaactttactggataattatgaaacttcaacagctgcatttgt 
tgaaacattggcactgtatggtatcacacaagaccatgacagtgtctatcaagctttaaaggtttctactccttttg 
cgattg agac attcgctcccaatttagagaattttttagaaaagtacaaggaaaatgaagccagagagcttgaac 
acccgattttatttga agga gtttctgacctattggaagacatttcaa^^^ 
60 catcgaaatgatcaggttttggaaattttagaaaaaacctctat 

gctcaggctttaagagaaagccaaatcccgaatccatgctttatttaagagaaaagtatcagattagctctggtct 
tgtcattggtgatcggccgattgatatcgaack:aggtcaagctgcaggacttgatacccacttgtttaccagtatc 
gtgaatttaagacaagtattagacatataa 

65 (SEQ ID NO: 1 58) 
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MKYHDYIWDLGGTLLDNYETSTAAFVETLALYGITQDHDSVYQALKVSTPFAETFAPNLENFLEKYKENEARELEHPIL 
FEGVSDLLEDISNQGGRHFLVSHRNDQVl^ll^KTSIAAYFTEVVTSSSGFKJlKJ'NPESMLYLREKYQISSGLVIGDRPtDIE 
AGQAAGLDTHLFTSIVNLRQVLDI 

IP21 1875bp 

(SEQ rDNO: 159) 

atgacagaagaaatcaaaaatctgcaggcacackjattatgatgccagtcaaattcaagttttagagggcttagag 

gctgttcgtatgcgtccagggatgtacattggatcaacctcaaaagaaggtcttcaccatctagtctgggaaattg 

ttgataactcaattgacgaggccttgck:aggatttgccagccatattcaagtttttattgagccagatgatt 

actgttgtggatgatgggcgtggtatcccagtcgatattcaggaaaaaacaggccgtcctgctgttgagaccgtct 

ttacagtccttcacgctggaggaaagttcggcggtggtggatacaaggtttcaggtggtcttcacggggtggggtc 

gtcagtagttaatck:cctttccactcaattagacgttcatgttcacaaaaatggtaagattcattaccaagaatacc 

gtcgtggtcatgttgtcgcagatcttgaaatagttggagatacggataaaacaggaacaactgttcacttcacacc 

ggacccaaaaatcttcactgaaacaacaatctttgattttgataaattaaataaacggattcaagagttggccttt 

ctaaatcgcggtcttcaaatttcaattacagataagcgccaaggtttggaacaaaccaagcattatcattatgaag 

gtgggattgctagttacgttgaatatatcaacgagaacaaggatgtaatctttgatacaccaatctatacagacgg 

tg ag atgg atg at atc ac agttg aggt agcc atgc agt ac ac a actggtt acc atg a a a atgtc atg agtttcgcc 

aataatattcatacccatgaaggtggaacacatgaacaaggtttccgtacagccttgacacgtgttatcaacgatt 

atgctcgtaaaaataagttactgaaagacaatgaagataatttaacaggggaagatgttcgcgaaggcttaactg 

cagttatctcagttaaacacccaaatccacagtttgaaggacaaaccaagaccaaattgggaaatagcgaagtgg 

tcaagattaccaatcgcctcttcagtgaagctttctccgatttcctcatggaaaatccacagattgccaaacgtatc 

gtagaaaaaggaattttggctgccaaggctcgtgtggctgccaagcgtgcgcgtgaagtcacacgtaaaaaatct 

ggtttggaaatttccaaccttccagggaaactagcagactgttcttctaataaccctgctgaaacagaactcttcat 

cgtcgaaggagactcagctggtggatcagccaaatctggtcgtaaccgtgagtttcaggctatccttccaattcgc 

ggtaagattttgaacgttgaaaaagcaagtatggataagattctagccaacgaagaaattcgtagtcttttcacag 

ccatgggaacaggatttggcgcagaatttgatgtttcgaaagcccgttaccaaaaactcgttttgatgaccgatgc 

cgatgtcgatggagcccacattcgtacccttcttttaaccttgatttatcgttatatgaaaccaatcctagaack^tg 

gttatgtttatattgcccaaccaccaatctatggtgtcaaggttggaagcgagattaaagaatatatccagccggg 

tck:agatcaagaaatcaaactccaagaagctttagcccgttatagtgaaggtcgtaccaaaccgactattcagcgt 

tataaggggctaggtgaaatggacgatcatcagctgtgggaaacaaccatggatcccgaacatcgcttgatggct 

agagtttctgtagatgatgtgcagaagcagataaaatctttgatatgttga 

(SEQ ID NO: 160) 

MTEE1KJSILQAQDYDASQIQVLEGLEAVRMRPGMYIGSTSKEGLHHLVWEIVDNSIDEALAGFASHIQVFIEPDDSITVVD 
DGRGIPVDIQEKTGRPAVETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTQLDVHVHKNGKIHYQEYRRGHV 
VADL£IVGDTDKTGTTVHFTPDPKIFTETTIFDFDKIJ^K^^ 
ENKDVIFDTPIYTDGEMDDITVEVAMQYTTGYHENVMSFAN 

NLTGEDVREGLTAVISVKHPNPQFEGQTKTKLGNSEVVKITNRLFSEAFSDFLMENPQIAKRIVEKGILAAKARVAAKRA 
REVTRKKSGLEISNLPGKLADCSSNNPAETELFIVEGDSAGGSAKSGRNREFQA1LPIRGKILNVEKASMDKILANEEIRSL 
FTAMGTGFGAETOVSKj\RYQK1 j VLMTDADVDGAH[RTLLLTLIYRYMKPILEAGYVYIAQPPIYGVKVGSEIKEYIQPGA 
DQEIK1.QEALARYSEGRTKPTIQRYKGLGEMDDHQLWETTMDPEHRLMARVSVDDVQKQIKSLIC 

ID54 1446bp 

(SEQ ID NO: 161) 

atgagtagacgttttaaaaaatcacgttcacagaaagtgaagcgaagtgttaatatagttttgctgactatttattt 

attgtt agtttg i" i ' i " i'l " i ' attgttctt a atcttt a agt ac a at atccttgctttt ag at atctt a atct agtggt a act 

gcgttagtcctactagttgccttggtagggctactcttgattatctataaaaaagctgaaaagtttactattrttct 

gttggtgttctctatccttgtcagctctgtgtcgctctttgcagtacack:agtttgttggactgaccaatcgttt 

tgcgacttctaattactcagaatattcaatcagtgtcgctgttttagcagatagtgagatcgaaaatgttacgcaac 

tgacgagtgtgacagcaccgactgggactaataatgaaaatattcagaaattactagctgatatcaagtcaagtca 

gaataccgatttgacggtcaaccagagttcgtcttacttggcagcttacaagagtttgattgcaggggagactaag 

gccattgtcctaaatagtgtctttgaaaacatcatcgagtcagagtatccagactacgcatcgaagataaaaaaga 

tttatactaagggattcactaaaaaagtagaagctcctaagacgtctaagagtcagtctttcaatatctatgttagt 

ggaattgacacctatggtcctattagttcggtgtcgcgatcagatgtcaacatcctgatgactgtcaatcgagata 

ccaagaaaatcctcttgaccacaacgccacgtgatgcctatgtaccaatcgcagatggtggaaataatcaaaaag 

ataaattgactcatgcgggcatttatggagttgattcgtccattcacaccttagaaaatctctatggagtggatatc 

aattactatgtgcgattgaacttcacttcgtttttgaaattgattc 

caagaatttactck:ccatacgaatggaaagtattaccctck:aggcaatgttcatcttgattcagaacaggctctcg 

gttttgttcgtgagcgctactccctagcagatggcgatcgtgaccgcgggcgccatcaacaaaaggtgattgtggc 

tatccttcaaaaattaacgtcaaccgaagtgctgaaaaattatagtacgatcattaatagcttgcaagattctatc 

caaacaaatatgccacttgagaccatgataaatttggtcaatgctcagttagaaagtggagggaattataaagta 

aattctcaagatttaaaagggacaggtcggatggatcttccttcttatgcaatgccagacagtaacctctatgtga 

tggaaatagatgatagtagtttagctgtagttaaagcagctatacaggatgtgatggagggtagatga 
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(SEQ ID NO: 162) 

MSRRFKKSRSQKVl^SVNFVLLTIYLLLVCFIX^ 
LVSSVSI^AVQQFVGLTNRLJ^ATSNYSEYSISVAVl^DS 

SSYLj\AYKSLIAGETKAIVLNSVFENIIESEYPDYASKIKKIYTKGFTKKVEAPKTSKSQSFN1YVSGIDTYGPISSVSRSDV 
NIIJVITVNRDTKKILLTTTPRDAYVPIADGGNNQKDKIT^ 

DVYNIDQEFTAHTNGKYYPAGNVHLDSEQALGFVRERYSLADGDRDRGRHQQKVIVArLQKLTSTEVLKNYSTIINSLQD 
SIQTNMPLETM INTLVNAQLESGGNYKVNSQDLKGTGRMDLPSYAMPDSNLYVMEIDDSSLAVVKAAIQDVMEGR 

1D55 732bp 

(SEQ ID NO: 163) 

ATGATAGACATCCATTCGCATATCGTTTTTGATGTAGATGACGGTCCCAAGTCAAGAGAGGAAAGCAAGGCTCTCT 

TGGCAGAATCCTACAGACAGGGGGTGCGAACCATTGTTTCTACCTCTCACCGTCGCAAGGGCATGTTTGAAACTCC 

GGAAGAGAAGATAGCAGAAAACTTTCTTCAGGTTCGGGAAATAGCTAAGGAAGTGGCGAGTGACTTGGTCATTGC 

TTACGGGGCTGAAATTTATTACACACCAGATGTTCTGGATAAGCTGGAAAAAAAGCGGATTCCGACCCTCAATGAT 

AGTCGTT ATGCCTTG AT AG AGTTT AGT ATG A AC ACTCCTT ATCGCG AT ATTC AT AGCGCCTTG AGC A AG ATCTTG AT 

GTTGGGAATTACTCCAGTCATTGCCCACATTGAGCGCTATGATGCTCTTGAAAATAATGAAAAACGCGTTCGAGAA 

CTGATCGATATGGGCTGTTACACGCAAGTAAATAGTTCACATGTCCTCAAACCCAAACTTTTTGGCGAACGTTATA 

AATTCATGAAAAAAAGAGCTCAGTATTTTITAGACCAGGATTTGGTTCATGTCATTGCAAGTGATATGCACAATCT 

AGACGGTAGACCTCCTCATATGGCAGAAGCATATGACCTTGTTACCCAAAAATACGGAGAAGCGAAGGCTCAGGA 

ACTTTTTATAGACAATCCTCGAAAAATTGTAATGGATCAACTAATTTAG 

(SEQ ID NO: 164) 

MIDIHSHIVFDVDDGPKSREESKALLAESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVASDLVIAYGAEI 
YYTPDVLDKl£KKRIPTLNDSRYALIEFSMNTPYRDIHSALSKILMLGITPVIAHIERYDAl^NNEKJiVRELIDMGCYTQV 
NSSHVLKPKLFGERYKFMKKJIAQYFLEQDLVHVIASDMHNLDGRPPHMAEAYDLVTQKYGEAKAQELFIDNPRKIVM 
DQLI 

ID58 3990bp 

(SEQ ID NO: 165) 

ttgatttatataatcgctatcaatataacaatgcaatcaggaggttttgcaatgaaacatgaaaaacaacagcgtt 

tttctattcgtaaatacck:tgtaggack:agcttctgttctaattggatttgccttccaagcacagactgttgcagcc 

gatggagttactcctactactacagaaaaccaaccgaccatccatacggtttctgattcccctcaatcatccgaaa 

atcggactgaggaaacacctaaagcagtgcttcaaccagaagctccaaaaactgtagaaacagaaactccagcta 

ctgataaggtagctagtcttccaaaaacagaagaaaaaccacaagaggaagttagttcaactcctagtgataaag 

cagaagtggtaactccaacttctgctgaaaaagaaactgctaataaaaaggcagaagaagctagccctaaaaagg 

aagaagcgaaagaggttgattctaaagagtcaaatacagacaagactgacaaggataaaccagctaaaaaagat 

gaagcgaaagcagaggctgacaaaccggcaacagaggcaggaaaggaacgtgctgcaactgtaaatgaaaaact 

agcgaaaaagaaaattgtttctattgatgctggacgtaaatatttctcaccagaacagctcaaggaaatcatcgat 

aaagcgaaacattatggctacactgatttacacctattagtcggaaatgatggactccgtttcatgttggacgata 

tgagcatcacagctaacggcaagacctatgccagtgacgatgtcaaacgcgccattgaaaaaggtacaaatgatt 

attacaacgatccaaacggcaatcacttaacagaaagtcaaatgacagatctgattaactatgccaaagataaag 

gtatcggtctcattccgacagtaaatagtcctggacacatggatgcgattctcaatgccatgaaagaattgggaat 

ccaaaaccctaactttagctattttgggaagaaatcagcccgtactgtcgatcttgacaacgaacaagctgtcgct 

tttacaaaagcccttatcgacaagtatgctgcttatttcc^gaaaaagactgaaatcttcaacatcggacttgat 

aatatgccaatgatgcgacagatgctaaaggttggagtgtgcttcaagctgataaatactatccaaacgaaggcta 

ccctgtaaaaggctatgaaaaatttattgcctacgccaatgacctcgctcgtattgtaaaatcgcacggtctcaaa 

ccaatgck:ttttaacgacggtatctactacaatagcgacacaagctttggtagttttgacaaagacatcatcgtttc 

t atgtgg actggtggttgggg aggct acg atgtcgcttcttct a a act act agctg aa a a aggtc a cc a a atcctt 

aataccaatgatgcttggtactacgttcttggacgaaacgctgatggccaaggctggtacaatctcgatcaggggc 

tcaatggtattaaaaacacaccaatcacttctgtaccaaaaacagaaggagctgatatcccaatcatcggtggtat 

ggtagctck:ttggck:tgacactccatctgcacgttattcaccatcacgcctcttcaaactcatgcgtcattttgcaa 

atgccaacgctgaatacttcgcagctgattatgaatctgcagagcaagcacttaacgaggtaccaaaagacctga 

accgttatactgcagaaagcgtcacgck:cgtaaaagaagctgaaaaagctattcgctctctcgatagcaaccttag 

ccgtgcccaacaagatacgattgatcaagccattgctaaacttcaagaaactgtcaacaacttgaccctcacgcct 

gaagctcaaaaagaagaagaagctaaacgtgaggttgaaaaacttgccaaaaacaaggtaatctcaatcgatgct 

ggacgcaaatactttactctgaaccagctcaaacgcatcgtagacaaggccagtgagctcggatattctgatgtcc 

atctccitctaggaaatgacggacttcck:tttctactcgatgatatgaccattactgccaacggaaaaacctatgct 

agtgatgacgttaaaaaagctattatcgaaggaactaaagcttactacgacgatccaaacggtactgcactaaca 

caggcagaagtaacagagctaattgaatacgctaaatctaaggacatcggtctcatcccagctattaacagtccag 

gtcacatggatgctatgctggttgccatggaaaaattaggtattaaaaatcctcaagcccactttgataaagtttc 

aaaaacaactatggacttgaaaaacgaagaagcgatgaactttgtaaaagccctcatcggtaaatacatggactt 

ctttgcaggtaaaacaaagattttcaactttggtactgacgaatacgccaacgatgcgactagtgcccaaggctgg 

tactacctcaagtggtatcaactctatggcaaatttgccgaatatgccaacaccctcgcagctatggccaaagaaa 

gagggcttcaaccaatggccttcaacgatgch^ttctactatgaagacaaggacgatgttcagtttgacaaagatgt 
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CTTG ATTTCTT ACTGGTCT AA AGCCTGGTGGGG AT AT A ACCTCGC ATC ACCTC A AT ACCT AGC A AGC A A A GGCT AT 

AAATTCTTGAATACCAACGGTGACTGGTACTACATTCITGGTCAAAAACCAGAAGATGGTGGTGGrrrCCTCAAGA 

AAGCTATTGAGAATACTGGAAAAACACCATTCAATCAACTAGCTTCTACCAAATATCCTGAAGTAGATCTTCCAAC 

AGTCGGAAGTATGCTTTCAATCTCKjGCAGATAGACCAAGCGCTGAATACAAGGAAGAGGAAATCTTTGAACTCAT 

G ACTGCCTTTGC AG ACC AC A AC A A AG ACT ACrTTTCCTGCT A ATT AT A ATCKn"CTCCGCG A AG A ATT ACKTT A A A ATT 

CCTACAAACTTAGAAGGATATAGTAAAGAAAGTCTTGAGCKIlCCTTGACGCAGCTAAAACAGCrCTAAATTACAACC 

TC A ACCGT A AT A A AC A AGCTG AGCHTG AC ACGCTTTGT AGCC A ACCH" A A A AGCCGCTCITC A AGGCCTC A A ACC AGC 

TGTAACTCATTCAGGAAGCCTAGATGAAAATGAAGTGGCTGCCAATGTTGAAACCAGACCAGAACTCATCACAAG 

AACTGAAGAAATTCCATTTGAAGTTATCAAGAAAGAAAATCCTAACCTCCCAGCCGGTCAGGAAAATATTATCACA 

GCAGGAGTCAAAGGTGAACGAACTCATTACATCTCTGTACTCACTGAAAATGGAAAAACAACAGAAACAGTCCTT 

GATAGCCAGGTAACCAAAGAAGTTATAAACCAAGTGGTTGAAGTTGGCGCTCCTGTAACTCACAAGGGTGATGAA 

AGTGGTCTTGCACCAACTACTGACK3TAAAACCTAGACTGGATATCCAAGAAGAAGAAATTCCATTTACCACAGTGA 

CTTGTGAAAATCCACTCTTACTCAAAGGAAAAACACAAGTCATTACTAAGGGCGTCAATGGACATCGTAGCAACTT 

CTACTCTGTGAGCACTTCTGCCGATGGTAAGGAAGTGAAAACACTTGTAAATAGTGTCGTAGCACAGGAAGCCGTT 

ACTCAAATAGTCGAAGTCGGAACTATGGTAACACATGTAGGCGATGAAAACGGACAAGCCGCTATTGCTGAAGAA 

AAACCAAAACTAGAAATCCCAAGCCAACCAGCTCCATCAACTGCTCCTGCTGAGGAAAGCAAAGTTCTTCCTCAAG 

ATCCAGCTCCTGTGGTAACAGAGAAAAAACTTCCTGAAACAGGAACTCACGATTCTGCAGGACTAGTAGTCGCAG 

GACTCATGTCCACACTAGCAGCCTATGGACTCACTAAAAGAAAAGAAGACTAA 

(SEQ ID NO: 166) 

MIYIlAINITMQSGGFAMKHEKQQRFSmKYAVGAASVLIGFAFQAQTVAADGVTPTTTENQPTIHTVSDSPQSSENRTEE 

TPKAVLQPEAPKTVETETPATDKVASLPKTEEKPQEEVSSTPSDKAEVVTPTSAEKETANKKAEEASPKKEEAKEVDSKE 

S NTD KTD KD KP A K KDE A K AE AD KP ATE AG KER A AT V N E KL A KKKIVSIDAGRKYFS PEQ L KE I [D KA JCH YG YTD LH LL 

VGNDGLRFMLDDMSITANGKTYASDDVKRAtEKGTNDYYNDPNGNHLTESQMTDLlNYAKDKGIGLIPTVNSPGHMDA 

ILNAMKELGIQNPNFSYFGKKSARTVDLDNEQAVAFTKALIDKYAAYFAKKTEIFNIGLDEYANDATDAKGWSVLQAD 

KYYPNEGYPVKGYEKFIAYANDLARIVKSHGLKPMAFNDGIYYNSDTSFGSFDKDirVSMWTGGWGGYDVASSKLLAE 

KGHQILNTNDAWYYVLGRNADGQGXWNLDQGLNGIKNTPITSVPKTC^ 

RHFANANAEYFAADYESAEQALNEVPKDLNRYTAESVTAVKEAEKAIRSLDSNLSRAQQDTrDQAIAKLQETVNNLTLT 

PEAQK£EEAKREVEKI-AK>IKV[SIDAGRKYFTLNQLKRIVDKASELGYSDVHLLLGNDGLRFLLDDMTITANGKTYASD 

DVKKAIIEGTKAYYDDPNGTALTQAEVTELIEYAKSKDIGLIPAlNSPGHMDAMLVAMEKI-GIKJvfPQAHFDKVSKTTMD 

LKJ^EEAMNFVKALIGKYMDFFAGKTKIFNFGTDEYANDATSAQG\\^YLKWQLYGKFAEYANTLAAMAKERGLQPM 

AFNDGFYYEDKDDVQFDKDVLISYWSKGWWGYNLASPQYLASKGYKFLNTNGDWYYILGQKPEDGGGFLKKAIENT 

GKTPFNQLASTKYPEVDLPTVGSMl^IWADRPSAEYKEEErFELMTAFADHNKDYFRANYNALREELAKIPTNLEGYSK 

ESLEALDAAKTALNYNLNRNKQAELDTLVANLKAALQGLKPAVTHSGSLDENEVAANVETRPELITRTEEIPFEVIKKEN 

PNLPAGQENIITAGVKGERTHYISVLTENGKTTETVLDSQVTKEVINQVVEVGAPVTHKGDESGLAPTTEVKPRLDIQEE 

EEPFTTVTCENPLLLKGKTQVITKGVNGHRSNFYSVSTSADGKEVKTLVNSVVAQEAVTQIVEVGTMVTHVGDENGQA 

AIAEEKPKLE1PSQPAPSTAPAEESKVLPQDPAPVVTEKKLPETGTHDSAGLVVAGLMSTLAAYGLTKRKED 

ID122 825bp 

(SEQ ID NO: 167) 

atg a ac aa a a a a ac a ag ac ag ac act a atc gg actgct agtgtt attgcttttgtct ac aggg agct att at atc a 

a gc ag atgccgtcggc acct a at agtccc a a a acc a atctt agtc ag a a a a a ac a agcgtctg a agctcct agtc a 

agcattggcagagagtgtcttaacagacgcagtcaagagtcaaataaaggggagtctggagtggaatggctcagg 

tck:ttttatcgtcaatggtaataaaacaaatctagatgccaaggtttcaagtaagccctacgctgacaataaaaca 

aagacagtgggcaaggaaactgttccaaccgtagctaatgccctcttgtctaaggccactcgtcagtacaagaatc 

gtaaagaaactgggaatggttcaacttcttggactcctccaggttggcatcaggtcaagaatctaaagggctctta 

tacccatgcagtcgatagaggtcatttgttagck^tatgccttaatcggtggtttggatggttttgatgcctcaacaa 

gcaatcctaaaaacattgctgttcagacagcctgggcaaatcaggcacaagccgagtattcgactggtcaaaacta 

ctatgaaagcaaggtgcgtaaagccttggaccaaaacaagcgtgtccgttaccgtgtaaccctttactacgcttca 

aacgaggatttagttccctcagcttcacagattgaagccaagtcttcggatggagaattggaattcaatgttctag 

ttccc a atgttc a a a aggg acttc a actgg att accg a actgg a g a agt a actgt a actc agt a a 

(SEQ ID NO: 168) 

MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPSQALAESVLTDAVKSQIKGSLEWNGSGAFIV 
NGNKTNLDAKVSSKPYADNKTKTVGK£TVPWANAL1^KATRQYKNRK£TGNGSTSWTPPGWHQVKNLKGSYTHAV 
DRGHLLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTGQNYYESKVRKALDQNKRVRYRVTLYYASNEDLV 
PSASQIEAKSSDGELEFNVLVPNVQKGLQLDYRTGEVTVTQ 

ID123 225bo 

(SEQ ID NO: 169) 

GTGCTAAGATTCAGCGGATTGAGGCAAGTGATGAAGATGAATAAGAAATCAAGCTACGTAGTCAAGCGTTTACTTT 
TAGTCATCATAGTACTGATTTTAGGTACTCTGGCTCTAGGAATCGGTTTAATGGTAGGTTATGGAATCTTGGGCAAG 
GGTCAAGATCCATGGGCTATCCTGTCTCCAGCAAAATGGCAGGAATTGATTCATAAATTTACAGGAAATTAG 
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(SEQ [D NO: 170) 

VU^FSGLRQVMKMNKKSSYVVKRLLLVIIVLILGTI^LGIGU^VCYGIU3KGQDPWAILSPAKWQELIHKJ=TGN 



5 



t 



